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and by the requirements for accuracy. The main classes 
of replication enzymes are considered here in terms of 
the problems they overcome. 

Access to the DNA strands that are to act as tem¬ 
plates requires separation of the two parent strands. 
This is generally accomplished by helicases, enzymes 
that move along the DNA and separate the strands, us¬ 
ing chemical energy from ATP. Strand separation cre¬ 
ates topological stress in the helical DNA structure (see 
Fig. 24-12), which is relieved by the action of topo- 
isomerases. The separated strands are stabilized by 
DNA-binding proteins. As noted earlier, before DNA 
polymerases can begin synthesizing DNA, primers must 
be present on the template—generally short segments 


FIGURE 25-8 Large (Klenow) fragment of DNA polymerase I. This 
polymerase is widely distributed in bacteria. The Klenow fragment, 
produced by proteolytic treatment of the polymerase, retains the poly¬ 
merization and proofreading activities of the enzyme. The Klenow 
fragment shown here is from the thermophilic bacterium Bacillus 
stearothermophilus (PDB ID 3BDP). The active site for addition of nu¬ 
cleotides is deep in the crevice at the far end of the bound DNA. The 
dark blue strand is the template. 


another set of subunits, a clamp-loading complex, or y 
complex, consisting of five subunits of four different 
types, T 2 y88'. The core polymerases are linked through 
the t (tau) subunits. Two additional subunits, x (chi) and 
t Jj (psi), are bound to the clamp-loading complex. The 
entire assembly of 13 protein subunits (nine different 
types) is called DNA polymerase III* (Fig. 25-10a). 

DNA polymerase III* can polymerize DNA, but with 
a much lower processivity than one would expect for 
the organized replication of an entire chromosome. The 
necessary increase in processivity is provided by the ad¬ 
dition of the /3 subunits, four of which complete the DNA 
polymerase III holoenzyme. The /3 subunits associate in 
pairs to form donut-shaped structures that encircle the 
DNA and act like clamps (Fig. 25-10b). Each dimer as¬ 
sociates with a core subassembly of polymerase III* (one 
dimeric clamp per core subassembly) and slides along 
the DNA as replication proceeds. The /3 sliding clamp 
prevents the dissociation of DNA polymerase III from 
DNA, dramatically increasing processivity—to greater 
than 500,000 (Table 25-1). 

DNA Replication Requires Many Enzymes 
and Protein Factors 

Replication in E. coli requires not just a single DNA 
polymerase but 20 or more different enzymes and pro¬ 
teins, each performing a specific task. The entire com¬ 
plex has been termed the DNA replicase system or 
replisome. The enzymatic complexity of replication re¬ 
flects the constraints imposed by the structure of DNA 
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FIGURE 25-9 Nick translation. In this process, an RNA or DNA strand 
paired to a DNA template is simultaneously degraded by the 5'—>3' 
exonuclease activity of DNA polymerase I and replaced by the poly¬ 
merase activity of the same enzyme. These activities have a role in 
both DNA repair and the removal of RNA primers during replication 
(both described later). The strand of nucleic acid to be removed (ei¬ 
ther DNA or RNA) is shown in green, the replacement strand in red. 
DNA synthesis begins at a nick (a broken phosphodiester bond, leav¬ 
ing a free 3' hydroxyl and a free 5' phosphate). Polymerase I extends 
the nontemplate DNA strand and moves the nick along the DNA—a 
process called nick translation. A nick remains where DNA polymerase 
I dissociates, and is later sealed by another enzyme. 
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FIGURE 25-1 Map of the E. coli chromosome. The map shows the 
relative positions of genes encoding many of the proteins important 
in DNA metabolism. The number of genes known to be involved pro¬ 
vides a hint of the complexity of these processes. The numbers 0 to 
100 inside the circular chromosome denote a genetic measurement 
called minutes. Each minute corresponds to -40,000 bp along the 


DNA molecule of E. coli. The three-letter names of genes and other 
elements generally reflect some aspect of their function. These include 
mut, mutagenesis; dna, DNA replication; pol, DNA po/ymerase; rpo, 
RNA polymerase; uvr, UV resistance; rec, recombination; dam, DNA 
adenine methylation; lig, DNA ligase; Ter, termination of replication; 
and ori, or/gin of replication. 


A Word about Terminology Before beginning to look 
closely at replication, we must make a short digression 
into the use of abbreviations in naming genes and pro¬ 
teins. By convention, bacterial genes generally are 
named using three lowercase, italicized letters that of¬ 
ten reflect their apparent function. For example, the 
dna, uvr, and rec genes affect DNA replication, resist¬ 
ance to the damaging effects of UV radiation, and re¬ 
combination, respectively. Where several genes affect 
the same process, the letters A, B, C, and so forth, are 
added—as in dnaA, dnaB, dnaQ, for example—usually 


reflecting their order of discovery rather than their or¬ 
der in a reaction sequence. 

During genetic investigations, the protein product 
of each gene is usually isolated and characterized. Many 
bacterial genes have been identified and named before 
the roles of their protein products are understood in 
detail. Sometimes the gene product is found to be a pre¬ 
viously isolated protein, and some renaming occurs. 
Often the product turns out to be an as yet unknown 
protein, with an activity not easily described by a sim¬ 
ple enzyme name. In a practice that can be confusing, 
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these bacterial proteins often retain the name of their 
genes. When referring to the protein, roman type is used 
and the first letter is capitalized: for example, the dnaA 
and recA gene products are called the DnaA and RecA 
proteins, respectively. You will encounter many such ex¬ 
amples in this chapter. 

Similar conventions exist for the naming of eukary¬ 
otic genes, although the exact form of the abbreviations 
may vary with the species and no single convention ap¬ 
plies to all eukaryotic systems. 


25.1 DNA Replication 

Long before the structure of DNA was known, scientists 
wondered at the ability of organisms to create faithful 
copies of themselves and, later, at the ability of cells to 
produce many identical copies of large and complex 
macromolecules. Speculation about these problems cen¬ 
tered around the concept of a template, a structure 
that would allow molecules to be lined up in a specific 
order and joined, to create a macromolecule with a 
unique sequence and function. The 1940s brought the 
revelation that DNA was the genetic molecule, but not 
until James Watson and Francis Crick deduced its struc¬ 
ture did the way in which DNA could act as a template 
for the replication and transmission of genetic informa¬ 
tion become clear: one strand is the complement of 
the other. The strict base-pairing rules mean that each 
strand provides the template for a sister strand with a 
predictable and complementary sequence (see Figs 
8—16, 8—17). ^ Nucleotides: Building Blocks of Nucleic Acids 

The fundamental properties of the DNA replication 
process and the mechanisms used by the enzymes that 
catalyze it have proved to be essentially identical in all 
species. This mechanistic unity is a major theme as we 
proceed from general properties of the replication 
process, to E. coli replication enzymes, and, finally, to 
replication in eukaryotes. 

DNA Replication Follows a Set of Fundamental Rules 

Early research on bacterial DNA replication and its en¬ 
zymes helped to establish several basic properties that 
have proven applicable to DNA synthesis in every 
organism. 

DNA Replication Is Semiconservative Each DNA strand 
serves as a template for the synthesis of a new strand, 
producing two new DNA molecules, each with one new 
strand and one old strand. This is semiconservative 
replication. 

Watson and Crick proposed the hypothesis of semi¬ 
conservative replication soon after publication of their 
1953 paper on the structure of DNA, and the hypothe¬ 
sis was proved by ingeniously designed experiments car¬ 
ried out by Matthew Meselson and Franklin Stahl in 


1957. Meselson and Stahl grew E. coli cells for many 
generations in a medium in which the sole nitrogen 
source (NH 4 C1) contained 1B N, the “heavy” isotope of 
nitrogen, instead of the normal, more abundant “light” 
isotope, 14 N. The DNA isolated from these cells had a 
density about 1% greater than that of normal [ 14 N]DNA 
(Fig. 25-2a). Although this is only a small difference, a 
mixture of heavy [ 15 N]DNA and light [ 14 N]DNA can be 
separated by centrifugation to equilibrium in a cesium 
chloride density gradient. 

The E. coli cells grown in the 15 N medium were 
transferred to a fresh medium containing only the 14 N 
isotope, where they were allowed to grow until the cell 
population had just doubled. The DNA isolated from 
these first-generation cells formed a single band in the 
CsCl gradient at a position indicating that the double¬ 
helical DNA molecules of the daughter cells were hy¬ 
brids containing one new 14 N strand and one parent 16 N 
strand (Fig. 25-2b). 

This result argued against conservative replication, 
an alternative hypothesis in which one progeny DNA 
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FIGURE 25-2 The Meselson-Stahl experiment, (a) Cells were grown 
for many generations in a medium containing only heavy nitrogen, 
15 N, so that all the nitrogen in their DNA was 15 N, as shown by a sin¬ 
gle band (blue) when centrifuged in a CsCl density gradient, (b) Once 
the cells had been transferred to a medium containing only light ni¬ 
trogen, 14 N, cellular DNA isolated after one generation equilibrated 
at a higher position in the density gradient (purple band), (c) Contin¬ 
uation of replication for a second generation yielded two hybrid DNAs 
and two light DNAs (red), confirming semiconservative replication. 
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molecule would consist of two newly synthesized DNA 
strands and the other would contain the two parent 
strands; this would not yield hybrid DNA molecules in 
the Meselson-Stahl experiment. The semiconservative 
replication hypothesis was further supported in the next 
step of the experiment (Fig. 25-2c). Cells were again 
allowed to double in number in the 14 N medium. The 
isolated DNA product of this second cycle of replication 
exhibited two bands in the density gradient, one with a 
density equal to that of light DNA and the other with 
the density of the hybrid DNA observed after the first 
cell doubling. 

Replication Begins at an Origin and Usually Proceeds Bidirec¬ 
tionally Following the confirmation of a semiconserva¬ 
tive mechanism of replication, a host of questions arose. 
Are the parent DNA strands completely unwound be¬ 
fore each is replicated? Does replication begin at ran¬ 
dom places or at a unique point? After initiation at any 
point in the DNA, does replication proceed in one di¬ 
rection or both? 

An early indication that replication is a highly co¬ 
ordinated process in which the parent strands are si¬ 
multaneously unwound and replicated was provided by 


John Cairns, using autoradiography. He made E. coli 
DNA radioactive by growing cells in a medium contain¬ 
ing thymidine labeled with tritium ( 3 H). When the DNA 
was carefully isolated, spread, and overlaid with a pho¬ 
tographic emulsion for several weeks, the radioactive 
thymidine residues generated “tracks” of silver grains in 
the emulsion, producing an image of the DNA molecule. 
These tracks revealed that the intact chromosome of 
E. coli is a single huge circle, 1.7 nun long. Radioactive 
DNA isolated from cells during replication showed an 
extra loop (Fig. 25-3a). Cairns concluded that the loop 
resulted from the formation of two radioactive daugh¬ 
ter strands, each complementary to a parent strand. One 
or both ends of the loop are dynamic points, termed 
replication forks, where parent DNA is being un¬ 
wound and the separated strands quickly replicated. 
Cairns’s results demonstrated that both DNA strands are 
replicated simultaneously, and a variation on his exper¬ 
iment (Fig. 25-3b) indicated that replication of bacter¬ 
ial chromosomes is bidirectional: both ends of the loop 
have active replication forks. 

The determination of whether the replication loops 
originate at a unique point in the DNA required land¬ 
marks along the DNA molecule. These were provided 




FIGURE 25-3 Visualization of bidirectional DNA replication. Repli¬ 
cation of a circular chromosome produces a structure resembling the 
Greek letter theta (6). (a) Labeling with tritium ( 3 H) shows that both 
strands are replicated at the same time (new strands shown in red). 
The electron micrographs illustrate the replication of a circular E. coli 
plasmid as visualized by autoradiography, (b) Addition of 3 H for a 


short period just before the reaction is stopped allows a distinction to 
be made between unidirectional and bidirectional replication, by de¬ 
termining whether label (red) is found at one or both replication forks 
in autoradiograms. This technique has revealed bidirectional replica¬ 
tion in E. coli, Bacillus subtilis, and other bacteria. 
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by a technique called denaturation mapping, devel¬ 
oped by Ross Inman and colleagues. Using the 48,502 
bp chromosome of bacteriophage A, Inman showed that 
DNA could be selectively denatured at sequences un¬ 
usually rich in A=T base pairs, generating a repro¬ 
ducible pattern of single-strand bubbles (see Fig. 8-31). 
Isolated DNA containing replication loops can be par¬ 
tially denatured in the same way. This allows the posi¬ 
tion and progress of the replication forks to be meas¬ 
ured and mapped, using the denatured regions as points 
of reference. The technique revealed that in this system 
the replication loops always initiate at a unique point, 
which was termed an origin. It also confirmed the ear¬ 
lier observation that replication is usually bidirectional. 
For circular DNA molecules, the two replication forks 
meet at a point on the side of the circle opposite to the 
origin. Specific origins of replication have since been 
identified and characterized in bacteria and lower 
eukaryotes. 

DNA Synthesis Proceeds in a 5'^>3' Direction and Is Semidis- 
continuous A new strand of DNA is always synthesized 
in the 5'—>3' direction, with the free 3' OH as the point 
at which the DNA is elongated (the 5' and 3' ends of a 
DNA strand are defined in Fig. 8-7). Because the two 
DNA strands are antiparallel, the strand serving as the 
template is read from its 3' end toward its 5' end. 

If synthesis always proceeds in the 5'—>3' direction, 
how can both strands be synthesized simultaneously? If 
both strands were synthesized continuously while the 
replication fork moved, one strand would have to un¬ 
dergo 3'—>5' synthesis. This problem was resolved by 
Reiji Okazaki and colleagues in the 1960s. Okazaki found 
that one of the new DNA strands is synthesized in short 
pieces, now called Okazaki fragments. This work ul¬ 
timately led to the conclusion that one strand is syn¬ 
thesized continuously and the other discontinuously 
(Fig. 25-4). The continuous strand, or leading strand, 
is the one in which 5'—>3' synthesis proceeds in the 
same direction as replication fork movement. The dis¬ 
continuous strand, or lagging strand, is the one in 
which 5'—>3' synthesis proceeds in the direction oppo¬ 
site to the direction of fork movement. Okazaki frag¬ 
ments range in length from a few hundred to a few thou¬ 
sand nucleotides, depending on the cell type. As we shall 
see later, leading and lagging strand syntheses are 
tightly coordinated. 

DNA Is Degraded by Nucleases 

To explain the enzymology of DNA replication, we first 
introduce the enzymes that degrade DNA rather than 
synthesize it. These enzymes are known as nucleases, 
or DNases if they are specific for DNA rather than RNA. 
Every cell contains several different nucleases, belong¬ 
ing to two broad classes: exonucleases and endonucle¬ 
ases. Exonucleases degrade nucleic acids from one 



FIGURE 25-4 Defining DNA strands at the replication fork. A new 

DNA strand (red) is always synthesized in the 5'—>3' direction. The 
template is read in the opposite direction, 3'—>5'. The leading strand 
is continuously synthesized in the direction taken by the replication 
fork. The other strand, the lagging strand, is synthesized discontinu¬ 
ously in short pieces (Okazaki fragments) in a direction opposite to 
that in which the replication fork moves. The Okazaki fragments are 
spliced together by DNA ligase. In bacteria, Okazaki fragments are 
-1,000 to 2,000 nucleotides long. In eukaryotic cells, they are 150 to 
200 nucleotides long. 


end of the molecule. Many operate in only the 5'—>3' or 
the 3'—>5' direction, removing nucleotides only from the 
5' or the 3' end, respectively, of one strand of a double- 
stranded nucleic acid or of a single-stranded DNA. En¬ 
donucleases can begin to degrade at specific internal 
sites in a nucleic acid strand or molecule, reducing it to 
smaller and smaller fragments. A few exonucleases and 
endonucleases degrade only single-stranded DNA. 
There are a few important classes of endonucleases that 
cleave only at specific nucleotide sequences (such as 
the restriction endonucleases that are so important in 
biotechnology; see Chapter 9, Fig. 9-3). You will en¬ 
counter many types of nucleases in this and subsequent 
chapters. 

DNA Is Synthesized by DNA Polymerases 

The search for an enzyme that 
could synthesize DNA began in 
1955. Work by Arthur Kornberg 
and colleagues led to the puri¬ 
fication and characterization of 
DNA polymerase from E. coli 
cells, a single-polypeptide en¬ 
zyme now called DNA poly¬ 
merase I (M r 103,000; encoded 
by the polA gene). Much later, 
investigators found that E. coli 
contains at least four other 
distinct DNA polymerases, de¬ 
scribed below. 

Detailed studies of DNA polymerase I revealed fea¬ 
tures of the DNA synthetic process that are now known 
to be common to all DNA polymerases. The fundamen- 
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tal reaction is a phosphoryl group transfer. The nucleo¬ 
phile is the 3'-hydroxyl group of the nucleotide at the 
3' end of the growing strand. Nucleophilic attack occurs 
at the a phosphorus of the incoming deoxynucleoside 
5'-triphosphate (Fig- 25-5). Inorganic pyrophosphate is 
released in the reaction. The general reaction is 

(dNMP) (1 + dNTP-> (dNMP)„ +1 + PPi (25-1) 

DNA Lengthened 

DNA 

where dNMP and dNTP are deoxynucleoside 5'-mono- 
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phosphate and 5'-triphosphate, respectively. The reac¬ 
tion appears to proceed with only a minimal change in 
free energy, given that one phosphodiester bond is 
formed at the expense of a somewhat less stable phos¬ 
phate anhydride. However, noncovalent base-stacking 
and base-pairing interactions provide additional stabi¬ 
lization to the lengthened DNA product relative to the 
free nucleotide. Also, the formation of products is facil¬ 
itated in the cell by the 19 kJ/mol generated in the sub¬ 
sequent hydrolysis of the pyrophosphate product by the 
enzyme pyrophosphatase. 
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Q single unpaired strand to act as template and a primer strand to provide a free hydroxyl group at 

jj, the 3' end, to which a new nucleotide unit is added. Each incoming nucleotide is selected in part 

/ \ by base pairing to the appropriate nucleotide in the template strand. The reaction product has a 

new free 3' hydroxyl, allowing the addition of another nucleotide, (b) The catalytic mechanism 
DNA polymerase likely involves two Mg 2+ ions, coordinated to the phosphate groups of the incoming nucleotide 

triphosphate and to three Asp residues, two of which are highly conserved in all DNA polymerases. 
The top Mg 2 1 ion in the figure facilitates attack of the 3'-hydroxyl group of the primer on the a 
phosphate of the nucleotide triphosphate; the lower Mg 2+ ion facilitates displacement of the 
pyrophosphate. Both ions stabilize the structure of the pentacovalent transition state. RNA 
polymerases use a similar mechanism (See Fig. 26-1 b). ^ Nucleic Acid Synthesis 
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Early work on DNA polymerase I led to the defini¬ 
tion of two central requirements for DNA polymeriza¬ 
tion. First, all DNA polymerases require a template. 
The polymerization reaction is guided by a template 
DNA strand according to the base-pairing rules pre¬ 
dicted by Watson and Crick: where a guanine is present 
in the template, a cytosine deoxynucleotide is added to 
the new strand, and so on. This was a particularly im¬ 
portant discovery, not only because it provided a chem¬ 
ical basis for accurate semiconservative DNA replication 
but also because it represented the first example of the 
use of a template to guide a biosynthetic reaction. 

Second, the polymerases require a primer. A primer 
is a strand segment (complementary to the template) 
with a free 3'-hydroxyl group to which a nucleotide can 
be added; the free 3' end of the primer is called the 
primer terminus. In other words, part of the new 
strand must already be in place: all DNA polymerases 
can only add nucleotides to a preexisting strand. Most 
primers are oligonucleotides of RNA rather than DNA, 
and specialized enzymes synthesize primers when and 
where they are required. 

After adding a nucleotide to a growing DNA strand, 
a DNA polymerase either dissociates or moves along the 
template and adds another nucleotide. Dissociation and 
reassociation of the polymerase can limit the overall 
polymerization rate—the process is generally faster 
when a polymerase adds more nucleotides without dis¬ 
sociating from the template. The average number of nu¬ 
cleotides added before a polymerase dissociates defines 
its processivity. DNA polymerases vary greatly in pro- 
cessivity; some add just a few nucleotides before disso¬ 
ciating, others add many thousands. ^ Nucleotide Poly¬ 
merization by DNA Polymerase 

Replication Is Very Accurate 

Replication proceeds with an extraordinary degree of fi¬ 
delity. In E. coli, a mistake is made only once for every 
10 9 to 10 10 nucleotides added. For the E. coli chromo¬ 
some of ~4.6 X 10® bp, this means that an error occurs 
only once per 1,000 to 10,000 replications. During poly¬ 
merization, discrimination between correct and incor¬ 
rect nucleotides relies not just on the hydrogen bonds 
that specify the correct pairing between complementary 
bases but also on the common geometry of the standard 
A=T and G=C base pairs (Fig. 25-6). The active site 
of DNA polymerase I accommodates only base pairs with 
this geometry. An incorrect nucleotide may be able to 
hydrogen-bond with a base in the template, but it gen¬ 
erally will not fit into the active site. Incorrect bases can 
be rejected before the phosphodiester bond is formed. 

The accuracy of the polymerization reaction itself, 
however, is insufficient to account for the high degree 
of fidelity in replication. Careful measurements in vitro 
have shown that DNA polymerases insert one incorrect 
nucleotide for every 10 4 to 10 5 correct ones. These 




(b) 


FIGURE 25-6 Contribution of base-pair geometry to the fidelity of 
DNA replication, (a) The standard A=T and G=C base pairs have very 
similar geometries, and an active site sized to fit one (blue shading) 
will generally accommodate the other, (b) The geometry of incorrectly 
paired bases can exclude them from the active site, as occurs on DNA 
polymerase I. 
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mistakes sometimes occur because a base is briefly in 
an unusual tautomeric form (see Fig. 8-9), allowing it to 
hydrogen-bond with an incorrect partner. In vivo, the er¬ 
ror rate is reduced by additional enzymatic mechanisms. 

One mechanism intrinsic to virtually all DNA poly¬ 
merases is a separate 3'—>5' exonuclease activity that 
double-checks each nucleotide after it is added. This 
nuclease activity permits the enzyme to remove a newly 
added nucleotide and is highly specific for mismatched 
base pairs (Fig. 25-7). If the polymerase has added the 
wrong nucleotide, translocation of the enzyme to the 
position where the next nucleotide is to be added is in¬ 
hibited. This kinetic pause provides the opportunity for 
a correction. The 3'—>5' exonuclease activity removes 
the mispaired nucleotide, and the polymerase begins 
again. This activity, known as proofreading, is not 
simply the reverse of the polymerization reaction (Eqn 
25-1), because pyrophosphate is not involved. The 
polymerizing and proofreading activities of a DNA poly¬ 
merase can be measured separately. Proofreading im¬ 
proves the inherent accuracy of the polymerization re¬ 
action 10 2 - to 10 3 -fold. In the monomeric DNA 
polymerase I, the polymerizing and proofreading activ¬ 
ities have separate active sites within the same 
polypeptide. 

When base selection and proofreading are com¬ 
bined, DNA polymerase leaves behind one net error for 
every 10 6 to 10 8 bases added. Yet the measured accu¬ 
racy of replication in E. coli is higher still. The addi¬ 
tional accuracy is provided by a separate enzyme sys¬ 
tem that repairs the mismatched base pairs remaining 
after replication. We describe this mismatch repair, 
along with other DNA repair processes, in Section 25.2. 

E. coli Has at Least Five DNA Polymerases 

More than 90% of the DNA polymerase activity observed 
in E. coli extracts can be accounted for by DNA poly¬ 
merase I. Soon after the isolation of this enzyme in 1955, 
however, evidence began to accumulate that it is not 
suited for replication of the large E. coli chromosome. 
First, the rate at which it adds nucleotides (600 nu- 
cleotides/min) is too slow (by a factor of 100 or more) 
to account for the rates at which the replication fork 
moves in the bacterial cell. Second, DNA polymerase I 
has a relatively low processivity. Third, genetic studies 
have demonstrated that many genes, and therefore 
many proteins, are involved in replication: DNA poly¬ 
merase I clearly does not act alone. Fourth, and most 
important, in 1969 John Cairns isolated a bacterial strain 
with an altered gene for DNA polymerase I that pro¬ 
duced an inactive enzyme. Although this strain was ab¬ 
normally sensitive to agents that damaged DNA, it was 
nevertheless viable! 

A search for other DNA polymerases led to the 
discovery of E. coli DNA polymerase II and DNA 
polymerase III in the early 1970s. DNA polymerase II 
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FIGURE 25-7 An example of error correction by the 3'—»5' exonu¬ 
clease activity of DNA polymerase I. Structural analysis has located 
the exonuclease activity ahead of the polymerase activity as the en¬ 
zyme is oriented in its movement along the DNA. A mismatched base 
(here, a C-A mismatch) impedes translocation of DNA polymerase I 
to the next site. Sliding backward, the enzyme corrects the mistake 
with its 3'—»5' exonuclease activity, then resumes its polymerase ac¬ 
tivity in the 5'—>3' direction. 


is an enzyme involved in one type of DNA repair (Sec¬ 
tion 25.3). DNA polymerase III is the principal replica¬ 
tion enzyme in E. coli. The properties of these three 
DNA polymerases are compared in Table 25-1. DNA 
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TABLE 25-1 Comparison of DNA Polymerases of E. coli 


DNA polymerase 


1 II 

III 


Structural gene* 

polA 

polB 

polC (dnaE) 

Subunits (number of different types) 

1 

7 

>10 

M r 

103,000 

88 , 000 * 

791,500 

3'—>5' Exonuclease (proofreading) 

Yes 

Yes 

Yes 

5'—>3' Exonuclease 

Yes 

No 

No 

Polymerization rate (nucleotides/s) 

16-20 

40 

250-1,000 

Processivity (nucleotides added 

3-200 

1,500 

>500,000 

before polymerase dissociates) 





*For enzymes with more than one subunit, the gene listed here encodes the subunit with polymerization activity. Note that dnaE 
is an earlier designation for the gene now referred to as polC. 


''’Polymerization subunit only. DNA polymerase II shares several subunits with DNA polymerase III, including the /3, y, 8, 8', x, 
and ifj subunits (see Table 25-2). 


polymerases IV and V, identified in 1999, are involved 
in an unusual form of DNA repair (Section 25.2). 

DNA polymerase I, then, is not the primary enzyme 
of replication; instead it performs a host of clean-up 
functions during replication, recombination, and repair. 
The polymerase’s special functions are enhanced by its 
5'—>3' exonuclease activity. This activity, distinct from 
the 3'—>5' proofreading exonuclease (Fig. 25-7), is lo¬ 
cated in a structural domain that can be separated from 
the enzyme by mild protease treatment. When the 
5'—>3' exonuclease domain is removed, the remaining 
fragment (M r 68,000), the large fragment or Klenow 
fragment (Fig. 25-8), retains the polymerization and 


proofreading activities. The 5'—>3' exonuclease activity 
of intact DNA polymerase I can replace a segment of 
DNA (or RNA) paired to the template strand, in a 
process known as nick translation (Fig. 25-9). Most 
other DNA polymerases lack a 5'—>3' exonuclease 
activity. 

DNA polymerase III is much more complex than 
DNA polymerase I, having ten types of subunits (Table 
25-2). Its polymerization and proofreading activities re¬ 
side in its a and s (epsilon) subunits, respectively. The 
6 subunit associates with a and e to form a core poly¬ 
merase, which can polymerize DNA but with limited 
processivity. Two core polymerases can be linked by 


TABLE 25-2 Subunits of DNA Polymerase III of E. coli 


Number of 
subunits per 


Subunit 

holoenzyme 

M r of subunit 

Gene 

Function of subunit 


a 

2 

129,900 

polC (dnaE) 

Polymerization activity 


£ 

2 

27,500 

dnaQ (mutD) 

3'—>5' Proofreading exonuclease 

Core polymerase 

e 

2 

8,600 

holE 



T 

2 

71,100 

dnaX 

Stable template binding; 






core enzyme dimerization 

Clamp-loading (y) complex that 

y 

1 

47,500 

dnaX * 

Clamp loader 

loads p subunits on lagging 

8 

1 

38,700 

holA 

Clamp opener 

strand at each Okazaki fragment 

8' 

1 

36,900 

holB 

Clamp loader 


X 

1 

16,600 

hole 

Interaction with SSB 


+ 

1 

15,200 

holD 

Interaction with y and \ 


P 

4 

40,600 

dnaN 

DNA clamp required for 



optimal processivity 


*The y subunit is encoded by a portion of the gene for the t subunit, such that the amino-terminal 66% of the r subunit has 
the same amino acid sequence as the y subunit. The y subunit is generated by a translational frameshifting mechanism (see 
Box 27-1) that leads to premature translational termination. 
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and by the requirements for accuracy. The main classes 
of replication enzymes are considered here in terms of 
the problems they overcome. 

Access to the DNA strands that are to act as tem¬ 
plates requires separation of the two parent strands. 
This is generally accomplished by helicases, enzymes 
that move along the DNA and separate the strands, us¬ 
ing chemical energy from ATP. Strand separation cre¬ 
ates topological stress in the helical DNA structure (see 
Fig. 24-12), which is relieved by the action of topo- 
isomerases. The separated strands are stabilized by 
DNA-binding proteins. As noted earlier, before DNA 
polymerases can begin synthesizing DNA, primers must 
be present on the template—generally short segments 


FIGURE 25-8 Large (Klenow) fragment of DNA polymerase I. This 
polymerase is widely distributed in bacteria. The Klenow fragment, 
produced by proteolytic treatment of the polymerase, retains the poly¬ 
merization and proofreading activities of the enzyme. The Klenow 
fragment shown here is from the thermophilic bacterium Bacillus 
stearothermophilus (PDB ID 3BDP). The active site for addition of nu¬ 
cleotides is deep in the crevice at the far end of the bound DNA. The 
dark blue strand is the template. 


another set of subunits, a clamp-loading complex, or y 
complex, consisting of five subunits of four different 
types, T 2 y88'. The core polymerases are linked through 
the t (tau) subunits. Two additional subunits, x (chi) and 
t Jj (psi), are bound to the clamp-loading complex. The 
entire assembly of 13 protein subunits (nine different 
types) is called DNA polymerase III* (Fig. 25-10a). 

DNA polymerase III* can polymerize DNA, but with 
a much lower processivity than one would expect for 
the organized replication of an entire chromosome. The 
necessary increase in processivity is provided by the ad¬ 
dition of the /3 subunits, four of which complete the DNA 
polymerase III holoenzyme. The /3 subunits associate in 
pairs to form donut-shaped structures that encircle the 
DNA and act like clamps (Fig. 25-10b). Each dimer as¬ 
sociates with a core subassembly of polymerase III* (one 
dimeric clamp per core subassembly) and slides along 
the DNA as replication proceeds. The /3 sliding clamp 
prevents the dissociation of DNA polymerase III from 
DNA, dramatically increasing processivity—to greater 
than 500,000 (Table 25-1). 

DNA Replication Requires Many Enzymes 
and Protein Factors 

Replication in E. coli requires not just a single DNA 
polymerase but 20 or more different enzymes and pro¬ 
teins, each performing a specific task. The entire com¬ 
plex has been termed the DNA replicase system or 
replisome. The enzymatic complexity of replication re¬ 
flects the constraints imposed by the structure of DNA 


5' 

3' 


Nick 

/ 

OH £g) RNA or DNA 

1 i i i i i r 


DNA 


5' 

Template 
DNA strand 


polymerase I 


5' 

3' 


1 \ A" 


! 1 "T T” 

J l 1 I I J L J. 

t— i—i—i—i—r 

i i i i J 1 L . 


3' 

5' 



3' 


FIGURE 25-9 Nick translation. In this process, an RNA or DNA strand 
paired to a DNA template is simultaneously degraded by the 5'—>3' 
exonuclease activity of DNA polymerase I and replaced by the poly¬ 
merase activity of the same enzyme. These activities have a role in 
both DNA repair and the removal of RNA primers during replication 
(both described later). The strand of nucleic acid to be removed (ei¬ 
ther DNA or RNA) is shown in green, the replacement strand in red. 
DNA synthesis begins at a nick (a broken phosphodiester bond, leav¬ 
ing a free 3' hydroxyl and a free 5' phosphate). Polymerase I extends 
the nontemplate DNA strand and moves the nick along the DNA—a 
process called nick translation. A nick remains where DNA polymerase 
I dissociates, and is later sealed by another enzyme. 
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Core (ae$) 



13 clamp 


18 clamp 
(open) 


FIGURE 25-10 DNA polymerase III. (a) Architecture of bacterial 
DNA polymerase III. Two core domains, composed of subunits a, e, 
and 0, are linked by a five-subunit y complex (also known as the 
clamp-loading complex) with the composition T 2 ySS'. The y and t 
subunits are encoded by the same gene. The y subunit is a shortened 
version of t; the t subunit thus contains a domain identical to y, along 
with an additional segment that interacts with the core polymerase. 
The other two subunits of DNA polymerase III*, x and ^ (not shown), 
also bind to the y complex. Two f3 clamps interact with the two-core 
subassembly, each clamp a dimer of the (3 subunit. The complex in¬ 
teracts with the DnaB helicase through the r subunit, (b) Two j8 sub¬ 
units of E. coli polymerase III form a circular clamp that surrounds the 
DNA. The clamp slides along the DNA molecule, increasing the pro- 
cessivity of the polymerase III holoenzyme to greater than 500,000 by 
preventing its dissociation from the DNA. The end-on view shows the 
two f) subunits as gray and light-blue ribbon structures surrounding a 
space-filling model of DNA. In the side view, surface contour models 
of the ;8 subunits (gray) surround a stick representation of a DNA dou¬ 
ble helix (light and dark blue) (derived from PDB ID 2POL). 



view 


Side view 



of RNA synthesized by enzymes known as primases. 
Ultimately, the RNA primers are removed and replaced 
by DNA; in E. coli, this is one of the many functions of 
DNA polymerase I. After an RNA primer is removed and 
the gap is filled in with DNA, a nick remains in the DNA 
backbone in the form of a broken phosphodiester bond. 
These nicks are sealed by DNA ligases. All these 
processes require coordination and regulation, an in¬ 
terplay best characterized in the E. coli system. 

Replication of the E. coli Chromosome 
Proceeds in Stages 

The synthesis of a DNA molecule can be divided into 
three stages: initiation, elongation, and termination, 


distinguished both by the reactions taking place and by 
the enzymes required. As you will find here and in the 
next two chapters, synthesis of the major information- 
containing biological polymers—DNAs, RNAs, and pro¬ 
teins—can be understood in terms of these same three 
stages, with the stages of each pathway having unique 
characteristics. The events described below reflect in¬ 
formation derived primarily from in vitro experiments 
using purified E. coli proteins, although the principles 
are highly conserved in all replication systems. 

Initiation The E. coli replication origin, oriC, consists 
of 245 bp; it bears DNA sequence elements that are 
highly conserved among bacterial replication origins. 
The general arrangement of the conserved sequences is 
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Tandem array of Binding sites for DnaA protein, 

three 13 bp sequences four 9 bp sequences 



Consensus sequence 
GATCTNTTNTTTT 

FIGURE 25-11 Arrangement of sequences in the E. coli replication 
origin, oriC. Although the repeated sequences (shaded in color) are 
not identical, certain nucleotides are particularly common in each po- 


Consensus sequence 
TTATCCACA 

sition, forming a consensus sequence. In positions where there is no 
consensus, N represents any of the four nucleotides. The arrows indi¬ 
cate the orientations of the nucleotide sequences. 



illustrated in Figure 25-11. The key sequences of in¬ 
terest here are two series of short repeats: three repeats 
of a 13 bp sequence and four repeats of a 9 bp sequence. 

At least nine different enzymes or proteins (sum¬ 
marized in Table 25-3) participate in the initiation phase 
of replication. They open the DNA helix at the origin 
and establish a preprinting complex for subsequent re¬ 
actions. The crucial component in the initiation process 
is the DnaA protein. A single complex of four to five 
DnaA protein molecules binds to the four 9 bp repeats 
in the origin (Fig. 25-12, step CD), then recognizes and 
successively denatures the DNA in the region of the 
three 13 bp repeats, which are rich in A=T pairs (step 
(2)). This process requires ATP and the bacterial his¬ 
tonelike protein HU. The DnaC protein then loads the 
DnaB protein onto the unwound region. Two ring- 
shaped hexamers of DnaB, one loaded onto each DNA 
strand, act as helicases, unwinding the DNA bidirec¬ 
tionally and creating two potential replication forks. If 
the E. coli single-stranded DNA-binding protein (SSB) 
and DNA gyrase (DNA topoisomerase II) are now added 
in vitro, thousands of base pairs are rapidly unwound 
by the DnaB helicase, proceeding out from the origin. 
Many molecules of SSB bind cooperatively to single- 
stranded DNA, stabilizing the separated strands and 
preventing renaturation while gyrase relieves the topo¬ 
logical stress produced by the DnaB helicase. When ad¬ 
ditional replication proteins are included in the in vitro 
system, the DNA unwinding mediated by DnaB is cou¬ 
pled to replication, as described below. 

Initiation is the only phase of DNA replication that 
is known to be regulated, and it is regulated such that 
replication occurs only once in each cell cycle. The 
mechanism of regulation is not yet well understood, but 
genetic and biochemical studies have provided a few 
insights. 

The timing of replication initiation is affected by 
DNA methylation and interactions with the bacterial 
plasma membrane. The oriC DNA is methylated by the 
Dam methylase (Table 25-3), which methylates the N 6 
position of adenine within the palindromic sequence 
(5')GATC. (Dam is not a biochemical expletive; it stands 
for DNA adenine methylation.) The oriC region ofE. coli 
is highly enriched in GATC sequences—it has 11 of them 
in its 245 bp, whereas the average frequency of GATC in 
the E. coli chromosome as a whole is 1 in 256 bp. 


Supercoiled 

template 





+ ATP 

DnaB 

DnaC 

DnaB helicase 


1 


Priming and 
replication 


FIGURE 25-12 Model for initiation of replication at the E. coli ori¬ 
gin, oriC. (X) About 20 DnaA protein molecules, each with a bound 
ATP, bind at the four 9 bp repeats. The DNA is wrapped around this 
complex. (2) The three A=T-rich 13 bp repeats are denatured se¬ 
quentially. (3) Hexamers of the DnaB protein bind to each strand, 
with the aid of DnaC protein. The DnaB helicase activity further un¬ 
winds the DNA in preparation for priming and DNA synthesis. 
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TABLE 25-3 Proteins Required to Initiate Replication at the E. coli Origin 


Protein 

M r 

Number of 

subunits 

Function 

DnaA protein 

52,000 

1 

Recognizes ori sequence; opens duplex at specific sites in 

DnaB protein (helicase) 

300,000 

6 * 

origin 

Unwinds DNA 

DnaC protein 

29,000 

1 

Required for DnaB binding at origin 

HU 

19,000 

2 

Histonelike protein; DNA-binding protein; stimulates initiation 

Primase (DnaG protein) 

60,000 

1 

Synthesizes RNA primers 

Single-stranded DNA-binding 
protein (SSB) 

75,600 

4* 

Binds single-stranded DNA 

RNA polymerase 

454,000 

5 

Facilitates DnaA activity 

DNA gyrase (DNA topoisomerase II) 

400,000 

4 

Relieves torsional strain generated by DNA unwinding 

Dam methylase 

32,000 

1 

Methylates (5')GATC sequences at oriC 


Subunits in these cases are identical. 


Immediately after replication, the DNA is hemi- 
methylated: the parent strands have methylated oriC 
sequences but the newly synthesized strands do not. The 
hemimethylated oriC sequences are now sequestered 
for a period by interaction with the plasma membrane 
(the mechanism is unknown). After a time, oriC is re¬ 
leased from the plasma membrane, and it must be fully 
methylated by Dam methylase before it can again bind 
DnaA. Regulation of initiation also involves the slow hy¬ 
drolysis of ATP by DnaA protein, which cycles the pro¬ 


tein between active (with bound ATP) and inactive (with 
bound ADP) forms on a timescale of 20 to 40 minutes. 

Elongation The elongation phase of replication includes 
two distinct but related operations: leading strand syn¬ 
thesis and lagging strand synthesis. Several enzymes at 
the replication fork are important to the synthesis of both 
strands. Parent DNA is first unwound by DNA helicases, 
and the resulting topological stress is relieved by topo- 
isomerases. Each separated strand is then stabilized by 


DNA topoisomerase II 
(DNA gyrase) 


Leading strand synthesis 
(DNA polymerase III) 



DNA 

primase 


RNA 

primer 


RNA primer 
from previous 
Okazaki 
fragment 




FIGURE 25-13 Synthesis of Okazaki 
fragments, (a) At intervals, primase 
synthesizes an RNA primer for a new 
Okazaki fragment. Note that if we 
consider the two template strands as 
lying side by side, lagging strand 
synthesis formally proceeds in the 
opposite direction from fork movement, 
(b) Each primer is extended by DNA 
polymerase III. (c) DNA synthesis 
continues until the fragment extends as 
far as the primer of the previously added 
Okazaki fragment. A new primer is 
synthesized near the replication fork to 
begin the process again. 
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SSB. From this point, synthesis of leading and lagging 
strands is sharply different. 

Leading strand synthesis, the more straightforward 
of the two, begins with the synthesis by primase (DnaG 
protein) of a short (10 to 60 nucleotide) RNA primer at 
the replication origin. Deoxyribonucleotides are added 
to this primer by DNA polymerase III. Leading strand 
synthesis then proceeds continuously, keeping pace 
with the unwinding of DNA at the replication fork. 

Lagging strand synthesis, as we have noted, is ac¬ 
complished in short Okazaki fragments. First, an RNA 


primer is synthesized by primase and, as in leading 
strand synthesis, DNA polymerase III binds to the RNA 
primer and adds deoxyribonucleotides (Fig. 25-13). On 
this level, the synthesis of each Okazaki fragment seems 
straightforward, but the reality is quite complex. The 
complexity lies in the coordination of leading and lag¬ 
ging strand synthesis: both strands are produced by a 
single asymmetric DNA polymerase III dimer, which is 
accomplished by looping the DNA of the lagging strand 
as shown in Figure 25-14, bringing together the two 
points of polymerization. 



FIGURE 25-14 DNA synthesis on the leading 
and lagging strands. Events at the replication fork 
are coordinated by a single DNA polymerase III 
dimer, in an integrated complex with DnaB 
helicase. This figure shows the replication 
process already underway (parts (a) through (e) 
are discussed in the text). The lagging strand is 
looped so that DNA synthesis proceeds steadily 
on both the leading and lagging strand templates 
at the same time. Red arrows indicate the 3' end 
of the two new strands and the direction of DNA 
synthesis. Black arrows show the direction of 
movement of the parent DNA through the 
complex. An Okazaki fragment is being 
synthesized on the lagging strand. 



DnaB 


Core 


Leading 
" strand 

Clamp-loading complex 
with open /3 sliding clamp 

Lagging strand 

RNA primer 
of previous 
Okazaki 
fragment 


(a) Continuous synthesis on the leading strand proceeds 
as DNA is unwound by the DnaB helicase. 




(e) 


I 



Discarded 
/3 clamp 



Primer of previous 
Okazaki fragment 
approaches core 
subunits 


Primase 


(b) DNA primase binds to DnaB, synthesizes 
a new primer, then dissociates. 
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The synthesis of Okazaki fragments on the lagging 
strand entails some elegant enzymatic choreography. 
The DnaB helicase and DnaG primase constitute a func¬ 
tional unit within the replication complex, the primo- 
some. DNA polymerase III uses one set of its core sub¬ 
units (the core polymerase) to synthesize the leading 
strand continuously, while the other set of core subunits 
cycles from one Okazaki fragment to the next on the 
looped lagging strand. The DnaB helicase unwinds the 
DNA at the replication fork (Fig. 25-14a) as it travels 
along the lagging strand template in the 5'—>3' direc¬ 
tion. DNA primase occasionally associates with DnaB 
helicase and synthesizes a short RNA primer (Fig. 
25-14b). A new (3 sliding clamp is then positioned at the 
primer by the clamp-loading complex of DNA poly¬ 
merase III (Fig. 25-14c). When synthesis of an Okazaki 
fragment has been completed, replication halts, and the 
core subunits of DNA polymerase III dissociate from 
their j3 sliding clamp (and from the completed Okazaki 
fragment) and associate with the new clamp (Fig. 
25-14d, e). This initiates synthesis of a new Okazaki 
fragment. As noted earlier, the entire complex respon¬ 
sible for coordinated DNA synthesis at a replication fork 
is a replisome. The proteins acting at the replication 
fork are summarized in Table 25-4. 

The replisome promotes rapid DNA synthesis, 
adding -1,000 nucleotides/s to each strand (leading and 
lagging). Once an Okazaki fragment has been com¬ 
pleted, its RNA primer is removed and replaced with 
DNA by DNA polymerase I, and the remaining nick is 
sealed by DNA ligase (Fig. 25-15). 

DNA ligase catalyzes the formation of a phosphodi- 
ester bond between a 3' hydroxyl at the end of one 
DNA strand and a 5' phosphate at the end of another 
strand. The phosphate must be activated by adenylyla- 
tion. DNA ligases isolated from viruses and eukaryotes 
use ATP for this purpose. DNA ligases from bacteria are 
unusual in that they generally use NAD + —a cofactor 
that normally functions in hydride transfer reactions 


Lagging 

strand 


-► - 

/ 

Nick rNMPs r-' 

dNTPs 

> 

DNA polymerase I 

f 


ATP (or NAD + ) 

AMP +PP, (or NMN) 

> 

DNA ligase 

f 



FIGURE 25-15 Final steps in the synthesis of lagging strand seg¬ 
ments. RNA primers in the lagging strand are removed by the 5'—>3' 
exonuclease activity of DNA polymerase I and replaced with DNA by 
the same enzyme. The remaining nick is sealed by DNA ligase. The 
role of ATP or NAD 1 is shown in Figure 25-16. 


(see Fig. 13-15)—as the source of the AMP activating 
group (Fig. 25-16). DNA ligase is another enzyme of 
DNA metabolism that has become an important reagent 
in recombinant DNA experiments (see Fig. 9-1). 

Termination Eventually, the two replication forks of the 
circular E. coli chromosome meet at a terminus region 
containing multiple copies of a 20 bp sequence called 
Ter (for terminus) (Fig. 25-17a). The Ter sequences are 
arranged on the chromosome to create a sort of trap 
that a replication fork can enter but cannot leave. The 
Ter sequences function as binding sites for a protein 
called Tus (terminus wtilizationsubstance). The Tus-Ter 
complex can arrest a replication fork from only one di¬ 
rection. Only one Tus-Ter complex functions per repli¬ 
cation cycle—the complex first encountered by either 


TABLE 25-4 Proteins at the E. coli Replication Fork 


Protein 

M r 

Number of 

subunits 

Function 

SSB 

75,600 

4 

Binding to single-stranded DNA 

DnaB protein (helicase) 

300,000 

6 

DNA unwinding; primosome constituent 

Primase (DnaG protein) 

60,000 

1 

RNA primer synthesis; primosome constituent 

DNA polymerase III 

791,500 

17 

New strand elongation 

DNA polymerase 1 

103,000 

1 

Filling of gaps; excision of primers 

DNA ligase 

74,000 

1 

Ligation 

DNA gyrase (DNA topoisomerase II) 

400,000 

4 

Supercoiling 


Modified from Kornberg, A. (1982) Supplement to DNA Replication, Table SI 1-2, W. H. Freeman and Company, New York. 
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Enzyme)—NH 3 
DNA ligase 



FIGURE 25-16 Mechanism of the DNA ligase reaction. In each of 
the three steps, one phosphodiester bond is formed at the expense of 
another. Steps (l) and © lead to activation of the 5' phosphate in 
the nick. An AMP group is transferred first to a Lys residue on the en¬ 
zyme and then to the 5' phosphate in the nick. In step ®, the 3'- 
hydroxyl group attacks this phosphate and displaces AMP, producing a 


phosphodiester bond to seal the nick. In the E. coli DNA ligase reac¬ 
tion, AMP is derived from NAD 1 . The DNA ligases isolated from a 
number of viral and eukaryotic sources use ATP rather than NAD 1 , 
and they release pyrophosphate rather than nicotinamide mononu¬ 
cleotide (NMN) in step (T). 



replication fork. Given that opposing replication forks 
generally halt when they collide, Ter sequences do not 
seem essential, but they may prevent overreplication by 
one replication fork in the event that the other is de¬ 
layed or halted by an encounter with DNA damage or 
some other obstacle. 

So, when either replication fork encounters a func¬ 
tional Tus-Ter complex, it halts; the other fork halts 
when it meets the first (arrested) fork. The final few 
hundred base pairs of DNA between these large protein 
complexes are then replicated (by an as yet unknown 
mechanism), completing two topologically interlinked 
(catenated) circular chromosomes (Fig. 25-17b). DNA 
circles linked in this way are known as catenanes. Sep¬ 
aration of the catenated circles in E. coli requires topoi- 
somerase IV (a type II topoisomerase). The separated 
chromosomes then segregate into daughter cells at cell 


division. The terminal phase of replication of other cir¬ 
cular chromosomes, including many of the DNA viruses 
that infect eukaryotic cells, is similar. 

Bacterial Replication Is Organized in Membrane- 
Bound Replication Factories 

The replication of a circular bacterial chromosome is 
highly organized. Once bidirectional replication is initi¬ 
ated at the origin, the two replisomes do not travel away 
from each other along the DNA. Instead, the replisomes 
are linked together and tethered to one point on the 
bacterial inner membrane, and the DNA substrate is fed 
through this “replication factory” (Fig. 25-18a). The 
tethering point is at the center of the elongated bacte¬ 
rial cell. After initiation, each of the two newly synthe¬ 
sized replication origins is partitioned into one half of 
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FIGURE 25-17 Termination of chromosome replication in 

E. coli. (a) The Ter sequences are positioned on the chromo¬ 
some in two clusters with opposite orientations, (b) Replication 
of the DNA separating the opposing replication forks leaves the 
completed chromosomes joined as catenanes, or topologically 
interlinked circles. The circles are not covalently linked, but 
because they are interwound and each is covalently closed, 
they cannot be separated—except by the action of topoiso- 
merases. In E. coli, a type II topoisomerase known as DNA 
topoisomerase IV plays the primary role in the separation of 
catenated chromosomes, transiently breaking both DNA strands 
of one chromosome and allowing the other chromosome to pass 
through the break. 


Origin 
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fork 


Clockwise 

fork 



Clockwise 
fork trap 


Counter¬ 
clockwise 
fork trap 




Clockwise 

fork 



Catenated 

chromosomes 


I"" 



Separated 
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(a) 


(b) 


the cell, and continuing replication extrudes each new 
chromosome into that half (Fig. 25-18b). The elaborate 
spatial organization of the newly replicated chromo¬ 
somes is orchestrated and maintained by many proteins, 
including bacterial homologs of the SMC proteins and 
topoisomerases (Chapter 24). Once replication is ter¬ 
minated, the cell divides, and the chromosomes se¬ 
questered in the two halves of the original cell are ac¬ 
curately partitioned into the daughter cells. When 
replication commences in the daughter cells, the origin 
of replication is sequestered in new replication factories 
formed at a point on the membrane at the center of the 
cell, and the entire process is repeated. 

Replication in Eukaryotic Cells Is More Complex 

The DNA molecules in eukaryotic cells are considerably 
larger than those in bacteria and are organized into com¬ 
plex nucleoprotein structures (chromatin; p. 938). The 
essential features of DNA replication are the same in 
eukaryotes and prokaryotes, and many of the protein 
complexes are functionally and structurally conserved. 
However, some interesting variations on the general 
principles discussed above promise new insights into the 
regulation of replication and its link with the cell cycle. 


Origins of replication, called autonomously repli¬ 
cating sequences (ARS) or replicators, have been 
identified and best studied in yeast. Yeast replicators 
span -150 bp and contain several essential conserved 
sequences. About 400 replicators are distributed among 
the 16 chromosomes in a haploid yeast genome. Initia¬ 
tion of replication in all eukaryotes requires a multi¬ 
subunit protein, the origin recognition complex (ORC), 
which binds to several sequences within the replicator. 
ORC interacts with and is regulated by a number of 
other proteins involved in control of the eukaryotic cell 
cycle. Two other proteins, CDC6 (discovered in a screen 
for genes affecting the cell division cycle) and CDT1 
(CdclO-dependent transcript 1), bind to ORC and me¬ 
diate the loading of a heterohexamer of minichromo- 
some -maintenance proteins (MCM2 to MCM7). The 
MCM complex is a ring-shaped replicative helicase, anal¬ 
ogous to the bacterial DnaB helicase. The CDC6 and 
CDT1 proteins have a role comparable to that of the 
bacterial DnaC protein, loading the MCM helicase onto 
the DNA near the replication origin. 

The rate of replication fork movement in eukary¬ 
otes (-50 nucleotides/s) is only one-twentieth that ob¬ 
served in E. coli. At this rate, replication of an average 
human chromosome proceeding from a single origin 
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FIGURE 25-18 Chromosome partitioning 
in bacteria, (a) All replication is carried 
out at a central replication factory that 
includes two complete replication forks, 
(b) The two replicated copies of the 
bacterial chromosome are extruded from 
the replication factory into the two halves 
of the cell, possibly with each newly 
synthesized origin bound separately to 
different points on the plasma membrane. 
Sequestering the two chromosome copies 
in separate cell halves facilitates their 
proper segregation at cell division. 
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would take more than 500 hours. Replication of human 
chromosomes in fact proceeds bidirectionally from 
many origins, spaced 30,000 to 300,000 bp apart. Eu¬ 
karyotic chromosomes are almost always much larger 
than bacterial chromosomes, so multiple origins are 
probably a universal feature in eukaryotic cells. 

Like bacteria, eukaryotes have several types of 
DNA polymerases. Some have been linked to particu¬ 
lar functions, such as the replication of mitochondrial 
DNA. The replication of nuclear chromosomes involves 
DNA polymerase a, in association with DNA poly¬ 
merase 5. DNA polymerase a is typically a multisub¬ 
unit enzyme with similar structure and properties in all 
eukaryotic cells. One subunit has a primase activity, and 
the largest subunit (M r -180,000) contains the poly¬ 
merization activity. However, this polymerase has no 
proofreading 3'—>5' exonuclease activity, making it un¬ 
suitable for high-fidelity DNA replication. DNA poly¬ 
merase a is believed to function only in the synthesis 
of short primers (containing either RNA or DNA) for 
Okazaki fragments on the lagging strand. These primers 


are then extended by the multisubunit DNA poly¬ 
merase 8. This enzyme is associated with and stimu¬ 
lated by a protein called proliferating cell nuclear anti¬ 
gen (PCNA; M t 29,000), found in large amounts in the 
nuclei of proliferating cells. The three-dimensional 
structure of PCNA is remarkably similar to that of the 
(3 subunit of E. coli DNA polymerase III (Fig. 25-10b), 
although primary sequence homology is not evident. 
PCNA has a function analogous to that of the j3 sub¬ 
unit, forming a circular clamp that greatly enhances the 
processivity of the polymerase. DNA polymerase 8 has 
a 3'—>5' proofreading exonuclease activity and appears 
to carry out both leading and lagging strand synthesis 
in a complex comparable to the dimeric bacterial DNA 
polymerase III. 

Yet another polymerase, DNA polymerase e, re¬ 
places DNA polymerase 8 in some situations, such as in 
DNA repair. DNA polymerase e may also function at the 
replication fork, perhaps playing a role analogous to that 
of the bacterial DNA polymerase I, removing the primers 
of Okazaki fragments on the lagging strand. 
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Many DNA viruses encode their own DNA poly¬ 
merases, and some of these have become targets for 
pharmaceuticals. For example, the DNA polymerase of 
the herpes simplex vims is inhibited by acyclovir, a com¬ 
pound developed by Gertrude Elion (p. 876). Acyclovir 
consists of guanine attached to an incomplete ribose 
ring. It is phosphorylated by a virally encoded thymi¬ 
dine kinase; acyclovir binds to this viral enzyme with an 
affinity 200 -fold greater than its binding to the cellular 
thymidine kinase. This ensures that phosphorylation oc¬ 
curs mainly in virus-infected cells. Cellular kinases con¬ 
vert the resulting acyclo-GMP to acyclo-GTP, which is 
both an inhibitor and a substrate of DNA polymerases, 
and which competitively inhibits the herpes DNA poly¬ 
merase more strongly than cellular DNA polymerases. 
Because it lacks a 3' hydroxyl, acyclo-GTP also acts as 
a chain terminator when incorporated into DNA. Thus 
viral replication is inhibited at several steps. 



Two other protein complexes also function in eu¬ 
karyotic DNA replication. RPA (replication protein A ) 
is a eukaryotic single-stranded DNA-binding protein, 
equivalent in function to the E. coli SSB protein. RFC 
(replication/actor C) is a clamp loader for PCNA and 
facilitates the assembly of active replication complexes. 
The subunits of the RFC complex have significant se¬ 
quence similarity to the subunits of the bacterial clamp¬ 
loading ( 7 ) complex. 

The termination of replication on linear eukaryotic 
chromosomes involves the synthesis of special struc¬ 
tures called telomeres at the ends of each chromo¬ 
some, as discussed in the next chapter. 

SUMMARY 25.1 DNA Replication 


■ Replication of DNA occurs with very high 
fidelity and at a designated time in the cell 
cycle. Replication is semiconservative, each 
strand acting as template for a new daughter 
strand. It is carried out in three identifiable 
phases: initiation, elongation, and termination. 
The reaction starts at the origin and usually 
proceeds bidirectionally. 

■ DNA is synthesized in the 5'—»3' direction by 
DNA polymerases. At the replication fork, the 
leading strand is synthesized continuously in 
the same direction as replication fork 
movement; the lagging strand is synthesized 
discontinuously as Okazaki fragments, which 
are subsequently ligated. 


■ The fidelity of DNA replication is maintained 
by ( 1 ) base selection by the polymerase, ( 2 ) a 
3'—>-5' proofreading exonuclease activity that is 
part of most DNA polymerases, and (3) specific 
repair systems for mismatches left behind after 
replication. 

■ Most cells have several DNA polymerases. In 
E. coli, DNA polymerase III is the primary 
replication enzyme. DNA polymerase I is 
responsible for special functions during 
replication, recombination, and repair. 

■ Replication of the E. coli chromosome involves 
many enzymes and protein factors organized in 
replication factories, in which template DNA is 
spooled through two replisomes tethered to the 
bacterial plasma membrane. 

■ Replication is similar in eukaryotic cells, but 
eukaryotic chromosomes have many replication 
origins. 


25.2 DNA Repair 

A cell generally has only one or two sets of genomic 
DNA. Damaged proteins and RNA molecules can be 
quickly replaced by using information encoded in the 
DNA, but DNA molecules themselves are irreplaceable. 
Maintaining the integrity of the information in DNA is a 
cellular imperative, supported by an elaborate set of 
DNA repair systems. DNA can become damaged by a 
variety of processes, some spontaneous, others cat¬ 
alyzed by environmental agents (Chapter 8 ). Replica¬ 
tion itself can very occasionally damage the information 
content in DNA when errors introduce mismatched base 
pairs (such as G paired with T). 

The chemistry of DNA damage is diverse and com¬ 
plex. The cellular response to this damage includes a 
wide range of enzymatic systems that catalyze some of 
the most interesting chemical transformations in DNA 
metabolism. We first examine the effects of alterations 
in DNA sequence and then consider specific repair 
systems. 

Mutations Are Linked to Cancer 

The best way to illustrate the importance of DNA repair 
is to consider the effects of unrepaired DNA damage 
(a lesion). The most serious outcome is a change in the 
base sequence of the DNA, which, if replicated and 
transmitted to future cell generations, becomes perma¬ 
nent. A permanent change in the nucleotide sequence 
of DNA is called a mutation. Mutations can involve the 
replacement of one base pair with another (substitution 
mutation) or the addition or deletion of one or more 
base pairs (insertion or deletion mutations). If the mu¬ 
tation affects nonessential DNA or if it has a negligible 
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FIGURE 25-19 Ames test for carcinogens, based on their muta¬ 
genicity. A strain of Salmonella typhimurium having a mutation that 
inactivates an enzyme of the histidine biosynthetic pathway is plated 
on a histidine-free medium. Few cells grow, (a) The few small colonies 
of 5. typhimurium that do grow on a histidine-free medium carry spon¬ 
taneous back-mutations that permit the histidine biosynthetic pathway 
to operate. Three identical nutrient plates (b), (c), and (d) have been 
inoculated with an equal number of cells. Each plate then receives a 
disk of filter paper containing progressively lower concentrations of a 
mutagen. The mutagen greatly increases the rate of back-mutation and 
hence the number of colonies. The clear areas around the filter paper 
indicate where the concentration of mutagen is so high that it is lethal 
to the cells. As the mutagen diffuses away from the filter paper, it is 
diluted to sublethal concentrations that promote back-mutation. Mu¬ 
tagens can be compared on the basis of their effect on mutation rate. 
Because many compounds undergo a variety of chemical transforma¬ 
tions after entering a cell, compounds are sometimes tested for mu¬ 
tagenicity after first incubating them with a liver extract. Some sub¬ 
stances have been found to be mutagenic only after this treatment. 


effect on the function of a gene, it is known as a silent 
mutation. Rarely, a mutation confers some biological 
advantage. Most nonsilent mutations, however, are 
deleterious. 

In mammals there is a strong correlation between 
the accumulation of mutations and cancer. A simple test 
developed by Bruce Ames measures the potential of a 
given chemical compound to promote certain easily de¬ 
tected mutations in a specialized bacterial strain (Fig. 
25-19). Few of the chemicals that we encounter in daily 
life score as mutagens in this test. However, of the com¬ 
pounds known to be carcinogenic from extensive animal 
trials, more than 90% are also found to be mutagenic in 
the Ames test. Because of this strong correlation be¬ 
tween mutagenesis and carcinogenesis, the Ames test 
for bacterial mutagens is widely used as a rapid and in¬ 
expensive screen for potential human carcinogens. 


The genome of a typical mammalian cell accumulates 
many thousands of lesions during a 24-hour period. 
However, as a result of DNA repair, fewer than 1 in 1,000 
becomes a mutation. DNA is a relatively stable mole¬ 
cule, but in the absence of repair systems, the cumula¬ 
tive effect of many infrequent but damaging reactions 
would make life impossible. 

All Cells Have Multiple DNA Repair Systems 

The number and diversity of repair systems reflect both 
the importance of DNA repair to cell survival and the 
diverse sources of DNA damage (Table 25-5). Some 
common types of lesions, such as pyrimidine dimers 
(see Fig. 8-34), can be repaired by several distinct sys¬ 
tems. Many DNA repair processes also appear to be ex¬ 
traordinarily inefficient energetically—an exception to 


TABLE 25-5 Types of DNA Repair Systems in E. coli 

Enzymes/proteins 

Type of damage 

Mismatch repair 

Dam methylase 

MutH, MutL, MutS proteins 
DNA helicase II 

SSB 

DNA polymerase III 
Exonuclease 1 

Exonuclease VII 

Red nuclease 

Exonuclease X 

DNA ligase 

Mismatches 

Base-excision repair 

DNA glycosylases 

Abnormal bases (uracil, 
hypoxanthine, xanthine); 
alkylated bases; in some 
other organisms, 
pyrimidine dimers 

AP endonucleases 

DNA polymerase 1 

DNA ligase 


Nucleotide-excision repair 

ABC excinuclease 

DNA lesions that cause 
large structural changes 
(e.g., pyrimidine dimers) 

DNA polymerase 1 

DNA ligase 


Direct repair 

DNA photolyases 

Pyrimidine dimers 

0 6 -Methylguanine-DNA 

0 6 -Methylguanine 

methyltransferase 

AlkB protein 

1-Methylguanine, 

3-methylcytosine 
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the pattern observed in the metabolic pathways, where 
every ATP is generally accounted for and used optimally. 
When the integrity of the genetic information is at stake, 
the amount of chemical energy invested in a repair 
process seems almost irrelevant. 

DNA repair is possible largely because the DNA mol¬ 
ecule consists of two complementary strands. DNA 
damage in one strand can be removed and accurately 
replaced by using the midamaged complementary strand 
as a template. We consider here the principal types of 
repair systems, beginning with those that repair the 
rare nucleotide mismatches that are left behind by repli¬ 
cation. 

Mismatch Repair Correction of the rare mismatches left 
after replication in E. coli improves the overall fidelity 
of replication by an additional factor of 10 2 to 10 3 . The 
mismatches are nearly always corrected to reflect the 
information in the old (template) strand, so the repair 
system must somehow discriminate between the tem¬ 
plate and the newly synthesized strand. The cell ac¬ 
complishes this by tagging the template DNA with 
methyl groups to distinguish it from newly synthesized 
strands. The mismatch repair system of E. coli includes 
at least 12 protein components (Table 25-5) that func¬ 
tion either in strand discrimination or in the repair 
process itself. 

The strand discrimination mechanism has not been 
worked out for most bacteria or eukaryotes, but is well 
understood for E. coli and some closely related bacte¬ 
ria. In these prokaryotes, strand discrimination is based 
on the action of Dam methylase (Table 25-3), which, as 
you will recall, methylates DNA at the N 6 position of all 
adenines within (5')GATC sequences. Immediately af¬ 
ter passage of the replication fork, there is a short pe¬ 
riod (a few seconds or minutes) during which the tem¬ 
plate strand is methylated but the newly synthesized 
strand is not (Fig. 25-20). The transient unmethylated 
state of GATC sequences in the newly synthesized 
strand permits the new strand to be distinguished from 
the template strand. Replication mismatches in the 
vicinity of a hemimethylated GATC sequence are then 
repaired according to the information in the methylated 
parent (template) strand. Tests in vitro show that if both 
strands are methylated at a GATC sequence, few mis¬ 
matches are repaired; if neither strand is methylated, 
repair occurs but does not favor either strand. The cell’s 


FIGURE 25-20 Methylation and mismatch repair. Methylation of 
DNA strands can serve to distinguish parent (template) strands from 
newly synthesized strands in E. coli DNA, a function that is critical to 
mismatch repair (see Fig. 25-21). The methylation occurs at the N b of 
adenines in (5')GATC sequences. This sequence is a palindrome (see 
Fig. 8-20), present in opposite orientations on the two strands. 
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methyl-directed mismatch repair system efficiently re¬ 
pairs mismatches up to 1,000 bp from a hemimethylated 
GATC sequence. For many bacterial species, the mech¬ 
anism of strand discrimination during mismatch repair 
has not been determined. 

How is the mismatch correction process directed 
by relatively distant GATC sequences? A mechanism is 
illustrated in Figure 25-21. MutL protein forms a com¬ 
plex with MutS protein, and the complex binds to all 
mismatched base pairs (except C-C). MutH protein 
binds to MutL and to GATC sequences encountered by 
the MutL-MutS complex. DNA on both sides of the mis¬ 
match is threaded through the MutL-MutS complex, cre¬ 
ating a DNA loop; simultaneous movement of both legs 
of the loop through the complex is equivalent to the 
complex moving in both directions at once along the 
DNA. MutH has a site-specific endonuclease activity that 
is inactive until the complex encounters a hemimethyl¬ 
ated GATC sequence. At this site, MutH catalyzes cleav¬ 
age of the unmethylated strand on the 5' side of the G 
in GATC, which marks the strand for repair. Further 
steps in the pathway depend on where the mismatch is 
located relative to this cleavage site (Fig. 25-22). 

When the mismatch is on the 5' side of the cleav¬ 
age site, the unmethylated strand is unwound and de¬ 
graded in the 3'—>5' direction from the cleavage site 
through the mismatch, and this segment is replaced with 
new DNA. This process requires the combined action of 
DNA helicase II, SSB, exonuclease I or exonuclease X 
(both of which degrade strands of DNA in the 3'—>5' di¬ 
rection), DNA polymerase III, and DNA ligase. The 
pathway for repair of mismatches on the 3' side of the 
cleavage site is similar, except that the exonuclease is 
either exonuclease VII (which degrades single-stranded 
DNA in the 5'—>3' or 3'—>5' direction) or RecJ nucle¬ 
ase (which degrades single-stranded DNA in the 5 '—>3' 
direction). 

Mismatch repair is a particularly expensive process 
for E. coli in terms of energy expended. The mismatch 
may be 1,000 bp or more from the GATC sequence. The 
degradation and replacement of a strand segment of this 
length require an enormous investment in activated de- 
oxynucleotide precursors to repair a single mismatched 
base. This again underscores the importance to the cell 
of genomic integrity. 

All eukaryotic cells have several proteins struc¬ 
turally and functionally analogous to the bacterial MutS 
and MutL (but not MutH) proteins. Alterations in hu¬ 
man genes encoding proteins of this type produce some 
of the most common inherited cancer-susceptibility syn¬ 
dromes (Box 25-1), further demonstrating the value to 
the organism of DNA repair systems. The main MutS ho¬ 
mologs in most eukaryotes, from yeast to humans, are 
MSH2 (MutS /zomolog 2), MSH3, and MSH6. Het¬ 
erodimers of MSH2 and MSH6 generally bind to single 
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FIGURE 25-21 A model for the early steps of methyl-directed mis¬ 
match repair. The proteins involved in this process in E. coli have been 
purified (see Table 25-5). Recognition of the sequence (5')GATC and 
of the mismatch are specialized functions of the MutH and MutS pro¬ 
teins, respectively. The MutL protein forms a complex with MutS at 
the mismatch. DNA is threaded through this complex such that the 
complex moves simultaneously in both directions along the DNA un¬ 
til it encounters a MutH protein bound at a hemimethylated GATC se¬ 
quence. MutH cleaves the unmethylated strand on the 5' side of the 
G in this sequence. A complex consisting of DNA helicase II and one 
of several exonucleases then degrades the unmethylated DNA strand 
from that point toward the mismatch (see Fig. 25-22). 
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BOX 25-1 BIOCHEMISTRY IN MEDICINE 


DNA Repair and Cancer 

Human cancer develops when certain genes that reg¬ 
ulate normal cell division (oncogenes and tumor 
suppressor genes; Chapter 12) fail to function, are ac¬ 
tivated at the wrong time, or are altered. As a conse¬ 
quence, cells may grow out of control and form a 
tumor. The genes controlling cell division can be dam¬ 
aged by spontaneous mutation or overridden by the 
invasion of a tumor virus (Chapter 26). Not surpris¬ 
ingly, alterations in DNA-repair genes that result in an 
increase in the rate of mutation can greatly increase 
an individual’s susceptibility to cancer. Defects in the 
genes encoding the proteins involved in nucleotide- 
excision repair, mismatch repair, recombinational re¬ 
pair, and error-prone translesion synthesis have all 
been linked to human cancers. Clearly, DNA repair can 
be a matter of life and death. 

Nucleotide-excision repair requires a larger num¬ 
ber of proteins in humans than in bacteria, although 
the overall pathways are very similar. Genetic defects 
that inactivate nucleotide-excision repair have been 
associated with several genetic diseases, the best- 
studied of which is xeroderma pigmentosum, or XP. 
Because nucleotide-excision repair is the sole repair 
pathway for pyrimidine dimers in humans, people with 
XP are extremely light sensitive and readily develop 
sunlight-induced skin cancers. Most people with XP 
also have neurological abnormalities, presumably be¬ 
cause of their inability to repair certain lesions caused 
by the high rate of oxidative metabolism in neurons. 
Defects in the genes encoding any of at least seven 
different protein components of the nucleotide- 
excision repair system can result in XP, giving rise 
to seven different genetic groups denoted XPA to 
XPG. Several of these proteins (notably XPB, XPD, 
and XPG) also play roles in transcription-coupled 
base-excision repair of oxidative lesions, described in 
Chapter 26. 

Most microorganisms have redundant pathways 
for the repair of cyclobutane pyrimidine dimers— 
making use of DNA photolyase and sometimes base- 
excision repair as alternatives to nucleotide-excision 
repair—but humans and other placental mammals do 
not. This lack of a back-up to nucleotide-excision 
repair for the removal of pyrimidine dimers has led to 
speculation that early mammalian evolution involved 
small, furry, nocturnal animals with little need to re¬ 
pair UV damage. However, mammals do have a path¬ 


way for the translesion bypass of cyclobutane pyrim¬ 
idine dimers, which involves DNA polymerase 17 . This 
enzyme preferentially inserts two A residues opposite 
a T-T pyrimidine dimer, minimizing mutations. Peo¬ 
ple with a genetic condition in which DNA polymerase 
17 function is missing exhibit an XP-like illness known 
as XP-variant or XP-V. Clinical manifestations of XP- 
V are similar to those of the classic XP diseases, al¬ 
though mutation levels are higher when cells are ex¬ 
posed to UV light. Apparently, the nucleotide-excision 
repair system works in concert with DNA polymerase 
17 in normal human cells, repairing and/or bypassing 
pyrimidine dimers as needed to keep cell growth and 
DNA replication going. Exposure to UV light intro¬ 
duces a heavy load of pyrimidine dimers, requiring 
that some be bypassed by translesion synthesis to 
keep replication on track. When either system is miss¬ 
ing, it is partly compensated for by the other. A loss 
of polymerase 17 activity leads to stalled replication 
forks and bypass of UV lesions by different, and more 
mutagenic, translesion synthesis (TLS) polymerases. 
As when other DNA repair systems are absent, the 
resulting increase in mutations often leads to cancer. 

One of the most common inherited cancer-sus¬ 
ceptibility syndromes is hereditary nonpolyposis colon 
cancer, or HNPCC. This syndrome has been traced to 
defects in mismatch repair. Human and other eukary¬ 
otic cells have several proteins analogous to the bac¬ 
terial MutL and MutS proteins (see Fig. 25-21). De¬ 
fects in at least five different mismatch repair genes 
can give rise to HNPCC. The most prevalent are de¬ 
fects in the hMLHl (human MutL homolog 1) and 
hMSH2 (human MutS homolog 2) genes. In individu¬ 
als with HNPCC, cancer generally develops at an early 
age, with colon cancers being most common. 

Most human breast cancer occurs in women with 
no known predisposition. However, about 10% of cases 
are associated with inherited defects in two genes, 
BRCA1 and BRCA2. BRCA1 and BRCA2 are large pro¬ 
teins (human BRCA1 and BRCA2 are 1834 and 3418 
amino acid residues long, respectively). They both in¬ 
teract with a wide range of other proteins involved in 
transcription, chromosome maintenance, DNA repair, 
and control of the cell cycle. However, the precise 
molecular function of BRACA1 and BRCA2 in these 
various cellular processes is not yet clear. Women with 
defects in either the BRCA1 or BRCA2 gene have a 
greater than 80% chance of developing breast cancer. 
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FIGURE 25-22 Completing methyl-directed mismatch repair. The 

combined action of DNA helicase II, SSB, and one of four different 
exonucleases removes a segment of the new strand between the MutH 
cleavage site and a point just beyond the mismatch. The exonuclease 


DNA polymerase III 
SSB 



that is used depends on the location of the cleavage site relative to 
the mismatch. The resulting gap is filled in by DNA polymerase III, 
and the nick is sealed by DNA ligase (not shown). 



base-pair mismatches, and bind less well to slightly 
longer mispaired loops. In many organisms the longer 
mismatches (2 to 6 bp) may be bound instead by a het- 
erodimer of MSH2 and MSH3, or are bound by both 
types of heterodimers in tandem. Homologs of MutL, 
predominantly a heterodimer of MLH1 and PMS1 (post- 
meiotic segregation), bind to and stabilize the MSH com¬ 
plexes. Many details of the subsequent events in eu¬ 
karyotic mismatch repair remain to be worked out. In 
particular, we do not know the mechanism by which 
newly synthesized DNA strands are identified, although 
research has revealed that this strand identification 
does not involve GATC sequences. 

Base-Excision Repair Every cell has a class of enzymes 
called DNA glycosylases that recognize particularly 
common DNA lesions (such as the products of cytosine 
and adenine deamination; see Fig. 8-33a) and remove 
the affected base by cleaving the A-glycosyl bond. This 
cleavage creates an apurinic or apyrimidinic site in the 
DNA, commonly referred to as an AP site or abasic 


site. Each DNA glycosylase is generally specific for one 
type of lesion. 

Uracil DNA glycosylases, for example, found in most 
cells, specifically remove from DNA the uracil that re¬ 
sults from spontaneous deamination of cytosine. Mutant 
cells that lack this enzyme have a high rate of G=C to 
A=T mutations. This glycosylase does not remove uracil 
residues from RNA or thymine residues from DNA. The 
capacity to distinguish thymine from uracil, the product 
of cytosine deamination—necessary for the selective re¬ 
pair of the latter—may be one reason why DNA evolved 
to contain thymine instead of uracil (p. 293). 

Bacteria generally have just one type of uracil DNA 
glycosylase, whereas humans have at least four types, 
with different specificities—an indicator of the impor¬ 
tance of uracil removal from DNA. The most abundant 
human uracil glycosylase, UNG, is associated with the 
human replisome, where it eliminates the occasional U 
residue inserted in place of a T during replication. The 
deamination of C residues is 100-fold faster in single- 
stranded DNA than in double-stranded DNA, and 
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FIGURE 25-23 DNA repair by the base-excision repair pathway. 

(l) A DNA glycosylase recognizes a damaged base and cleaves 
between the base and deoxyribose in the backbone. © An AP en¬ 
donuclease cleaves the phosphodiester backbone near the AP site. 
© DNA polymerase I initiates repair synthesis from the free 3' hy¬ 
droxyl at the nick, removing (with its 5'—>3' exonuclease activity) a 
portion of the damaged strand and replacing it with undamaged DNA. 
@ The nick remaining after DNA polymerase I has dissociated is 
sealed by DNA ligase. 


humans have the enzyme hSMUGl, which removes any 
U residues that occur in single-stranded DNA during 
replication or transcription. Two other human DNA 
glycosylases, TDG and MBD4, remove either U or T 
residues paired with G, generated by deamination of 
cytosine or 5-methylcytosine, respectively. 

Other DNA glycosylases recognize and remove a va¬ 
riety of damaged bases, including formamidopyrimidine 
and 8-hydroxyguanine (both arising from purine oxida¬ 
tion), hypoxanthine (arising from adenine deamina¬ 
tion) , and alkylated bases such as 3-methyladenine and 
7-methylguanine. Glycosylases that recognize other le¬ 
sions, including pyrimidine dimers, have also been iden¬ 
tified in some classes of organisms. Remember that AP 
sites also arise from the slow, spontaneous hydrolysis of 
the Af-glycosyl bonds in DNA (see Fig. 8-33b). 

Once an AP site has formed, another group of en¬ 
zymes must repair it. The repair is not made by simply 
inserting a new base and re-forming the Af-glycosyl 
bond. Instead, the deoxyribose 5'-phosphate left behind 
is removed and replaced with a new nucleotide. This 
process begins with AP endonucleases, enzymes that 
cut the DNA strand containing the AP site. The position 
of the incision relative to the AP site (5' or 3' to the 
site) varies with the type of AP endonuclease. A seg¬ 
ment of DNA including the AP site is then removed, 
DNA polymerase I replaces the DNA, and DNA ligase 
seals the remaining nick (Fig. 25-23). In eukaryotes, nu¬ 
cleotide replacement is carried out by specialized poly¬ 
merases, as described below. 

Nucleotide-Excision Repair DNA lesions that cause large 
distortions in the helical structure of DNA generally are 
repaired by the nucleotide-excision system, a repair 
pathway critical to the survival of all free-living organ¬ 
isms. In nucleotide-excision repair (Fig. 25-24), a mul¬ 
tisubunit enzyme hydrolyzes two phosphodiester bonds, 
one on either side of the distortion caused by the lesion. 
InE. coli and other prokaryotes, the enzyme system hy¬ 
drolyzes the fifth phosphodiester bond on the 3' side 
and the eighth phosphodiester bond on the 5' side to 
generate a fragment of 12 to 13 nucleotides (depending 
on whether the lesion involves one or two bases). In hu¬ 
mans and other eukaryotes, the enzyme system hy¬ 
drolyzes the sixth phosphodiester bond on the 3' side 
and the twenty-second phosphodiester bond on the 5' 
side, producing a fragment of 27 to 29 nucleotides. Fol¬ 
lowing the dual incision, the excised oligonucleotides 
are released from the duplex and the resulting gap is 
filled—by DNA polymerase I in E. coli and DNA poly¬ 
merase e in humans. DNA ligase seals the nick. 

In E. coli, the key enzymatic complex is the ABC 
excinuclease, which has three subunits, UvrA (M r 
104,000), UvrB (M r 78,000), and UvrC (M r 68,000). The 
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FIGURE 25-24 Nucleotide-excision repair in E. c oli and humans. The general pathway of 
nucleotide-excision repair is similar in all organisms. CD An excinuclease binds to DNA at the site 
of a bulky lesion and cleaves the damaged DNA strand on either side of the lesion. (2) The DNA 
segment—of 13 nucleotides (13 mer) or 29 nucleotides (29 mer)—is removed with the aid of a 
helicase. @ The gap is filled in by DNA polymerase, and @ the remaining nick is sealed with 
DNA ligase. 



term “excinuclease” is used to describe the unique ca¬ 
pacity of this enzyme complex to catalyze two specific 
endonucleolytic cleavages, distinguishing this activity 
from that of standard endonucleases. A complex of the 
UvrA and UvrB proteins (A 2 B) scans the DNA and binds 
to the site of a lesion. The UvrA dimer then dissociates, 
leaving a tight UvrB-DNA complex. UvrC protein then 
binds to UvrB, and UvrB makes an incision at the fifth 
phosphodiester bond on the 3' side of the lesion. This 
is followed by a UvrC-mediated incision at the eighth 
phosphodiester bond on the 5' side. The resulting 12 to 
13 nucleotide fragment is removed by UvrD helicase. 
The short gap thus created is filled in by DNA poly¬ 
merase I and DNA ligase. This pathway is a primary 
repair route for many types of lesions, including cyclo¬ 


butane pyrimidine dimers, 6-4 photoproducts (see Fig. 
8-34), and several other types of base adducts includ¬ 
ing benzo[a]pyrene-guanine, which is formed in DNA by 
exposure to cigarette smoke. The nucleolytic activity of 
the ABC excinuclease is novel in the sense that two cuts 
are made in the DNA (Fig. 25-24). 

The mechanism of eukaryotic excinucleases is quite 
similar to that of the bacterial enzyme, although 16 poly¬ 
peptides with no similarity to the E. coli excinuclease 
subunits are required for the dual incision. As described 
in Chapter 26, some of the nucleotide-excision repair 
and base-excision repair in eukaryotes is closely tied 
to transcription. Genetic deficiencies in nucleotide- 
excision repair in humans give rise to a variety of serious 
diseases (Box 25-1). 
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light 



Monomeric pyrimidines 
in repaired DNA 


MECHANISM FIGURE 25-25 Repair of pyrimidine dimers with pho¬ 
tolyase. Energy derived from absorbed light is used to reverse the pho¬ 
toreaction that caused the lesion. The two chromophores in E. coli 
photolyase (M, 54,000), /V 5 ,/V 10 -methenyltetrahydrofolylpolygluta- 
mate (MTHFpolyGlu) and FADH “, perform complementary functions. 
On binding of photolyase to a pyrimidine dimer, repair proceeds as 
follows. CD A blue-light photon (300 to 500 nm wavelength) is ab¬ 


sorbed by the MTHFpolyGlu, which functions as a photoantenna. 
© The excitation energy passes to FADH ' in the active site of the 
enzyme. © The excited flavin (*FADH~) donates an electron to the 
pyrimidine dimer (shown here in a simplified representation) to gen¬ 
erate an unstable dimer radical. (4) Electronic rearrangement restores 
the monomeric pyrimidines, and © the electron is transferred back 
to the flavin radical to regenerate FADH . 


Direct Repair Several types of damage are repaired 
without removing a base or nucleotide. The best-char¬ 
acterized example is direct photoreactivation of cy¬ 
clobutane pyrimidine dimers, a reaction promoted by 
DNA photolyases. Pyrimidine dimers result from an 
ultraviolet light-induced reaction, and photolyases use 
energy derived from absorbed light to reverse the dam¬ 


age (Fig. 25-25). Photolyases generally contain two 
cofactors that serve as light-absorbing agents, or chro¬ 
mophores. One of the chromophores is always FADH - . 
In E. coli and yeast, the other chromophore is a folate. 
The reaction mechanism entails the generation of free 
radicals. DNA photolyases are not present in the cells 
of placental mammals (which include humans). 
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FIGURE 25-26 Example of how DNA damage results in mutations. 

(a) The methylation product 0 6 -methylguanine pairs with thymine 
rather than cytosine, (b) If not repaired, this leads to a G=C to A=T 
mutation after replication. 


Additional examples can be seen in the repair of 
nucleotides with alkylation damage. The modified nu¬ 
cleotide 0 6 -methylguanine forms in the presence of 
alkylating agents and is a common and highly mutagenic 
lesion (p. 295). It tends to pair with thymine rather than 
cytosine during replication, and therefore causes G=C 
to A=T mutations (Fig. 25-26). Direct repair of O 6 - 
methylguanine is carried out by 0 6 -methylguanine-DNA 
methyltransferase, a protein that catalyzes transfer of 
the methyl group of 0 6 -methylguanine to one of its own 
Cys residues. This methyltransferase is not strictly an 
enzyme, because a single methyl transfer event perma¬ 
nently methylates the protein, making it inactive in this 
pathway. The consumption of an entire protein mole¬ 
cule to correct a single damaged base is another vivid 
illustration of the priority given to maintaining the in¬ 
tegrity of cellular DNA. 
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A very different but equally direct mechanism is 
used to repair 1-methyladenine and 3-methylcytosine. 
The amino groups of A and C residues are sometimes 
methylated when the DNA is single-stranded, and the 
methylation directly affects proper base pairing. In 
E. coli, oxidative demethylation of these alkylated nu¬ 
cleotides is mediated by the AlkB protein, a member of 
the a-ketoglutarate-Fe 2+ -dependent dioxygenase su¬ 
perfamily (Fig. 25-27). (See Box 4-3 for a description 
of another member of this enzyme family.) 
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FIGURE 25-27 Direct repair of alkylated bases by AlkB. The AlkB 
protein is an a-ketoglutarate-Fe 2+ -dependent dioxygenase. It cat¬ 
alyzes the oxidative demethylation of 1-methyladenine and 3-methyl- 
cytosine residues. 



OH 


H 2 C = 0 + H 4 


Formaldehyde 


NH 2 



co 2 



3-Methylcytosine 


H 2 C=0 + H 1 


Formaldehyde 



The Interaction of Replication Forks with DNA 
Damage Can Lead to Error-Prone Translesion 
DNA Synthesis 

The repair pathways considered to this point generally 
work only for lesions in double-stranded DNA, the un¬ 
damaged strand providing the correct genetic informa¬ 
tion to restore the damaged strand to its original state. 
However, in certain types of lesions, such as double¬ 
strand breaks, double-strand cross-links, or lesions in a 
single-stranded DNA, the complementary strand is it¬ 
self damaged or is absent. Double-strand breaks and le¬ 
sions in single-stranded DNA most often arise when a 
replication fork encounters an unrepaired DNA lesion 
(Fig. 25-28). Such lesions and DNA cross-links can also 
result from ionizing radiation and oxidative reactions. 

At a stalled bacterial replication fork, there are two 
avenues for repair. In the absence of a second strand, 
the information required for accurate repair must come 
from a separate, homologous chromosome. The repair 
system thus involves homologous genetic recombina¬ 
tion. This recombinational DNA repair is considered 
in detail in Section 25.3. Under some conditions, a sec¬ 
ond repair pathway, error-prone translesion DNA 
synthesis (often abbreviated TLS), becomes available. 
When this pathway is active, DNA repair becomes sig¬ 
nificantly less accurate and a high mutation rate can re¬ 
sult. In bacteria, error-prone translesion DNA synthesis 
is part of a cellular stress response to extensive DNA 
damage known, appropriately enough, as the SOS re¬ 
sponse. Some SOS proteins, such as the UvrA and UvrB 
proteins already described (Table 25-6), are normally 


present in the cell but are induced to higher levels as 
part of the SOS response. Additional SOS proteins par¬ 
ticipate in the pathway for error-prone repair; these in¬ 
clude the UrnuC and UmuD proteins (“Umu” from im¬ 
mutable; lack of the umu gene function eliminates 
error-prone repair). The UmuD protein is cleaved in an 
SOS-regulated process to a shorter form called UmuD', 
which forms a complex with UmuC to create a special¬ 
ized DNA polymerase (DNA polymerase V) that can 
replicate past many of the DNA lesions that would nor¬ 
mally block replication. Proper base pairing is often im¬ 
possible at the site of such a lesion, so this translesion 
replication is error-prone. 

Given the emphasis on the importance of genomic 
integrity throughout this chapter, the existence of a sys¬ 
tem that increases the rate of mutation may seem in¬ 
congruous. However, we can think of this system as a 
desperation strategy. The umuC and umuD genes are 
fully induced only late in the SOS response, and they 
are not activated for translesion synthesis initiated by 
UmuD cleavage unless the levels of DNA damage are 
particularly high and all replication forks are blocked. 
The mutations resulting from DNA polymerase V- 
mediated replication kill some cells and create deleteri¬ 
ous mutations in others, but this is the biological price 
an organism pays to overcome an otherwise insur¬ 
mountable barrier to replication, as it permits at least a 
few mutant cells to survive. 

In addition to DNA polymerase V, translesion repli¬ 
cation requires the RecA protein, SSB, and some sub¬ 
units derived from DNA polymerase III. Yet another DNA 
polymerase, DNA polymerase IV, is also induced during 
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FIGURE 25-28 DNA damage and its effect on 
DNA replication. If the replication fork encounters 
an unrepaired lesion or strand break, replication 
generally halts and the fork may collapse. A lesion 
is left behind in an unreplicated, single-stranded 
segment of the DNA; a strand break becomes a 
double-strand break. In each case, the damage to 
one strand cannot be repaired by mechanisms 
described earlier in this chapter, because the 
complementary strand required to direct accurate 
repair is damaged or absent. There are two 
possible avenues for repair: recombinational DNA 
repair (described in Fig. 25-37) or, when lesions 
are unusually numerous, error-prone repair. The 
latter mechanism involves a novel DNA poly¬ 
merase (DNA polymerase V, encoded by the 
umuC and umuD genes) that can replicate, albeit 
inaccurately, over many types of lesions. The 
repair mechanism is referred to as error-prone 
because mutations often result. 


the SOS response. Replication by DNA polymerase IV, 
a product of the dinB gene, is also highly error-prone. 
The bacterial DNA polymerases IV and V are part of a 
family of TLS polymerases found in all organisms. These 
enzymes lack a proofreading exonuclease activity, and 
the fidelity of replicative base selection can be reduced 


by a factor of 10 2 , lowering overall replication fidelity to 
one error in -1,000 nucleotides. 

Mammals have many low-fidelity DNA polymerases 
of the TLS polymerase family. However, the presence of 
these enzymes does not necessarily translate into an 
unacceptable mutational burden, because most of the 


TABLE 25-6 Genes Induced as Part of the SOS Response in E. coli 

Gene name Protein encoded and/or rote in DNA repair 


Genes of known function 

polB ( dinA) 

uvrA 1 
uvrB I 
umuC ] 
umuD J 
s ulA 
recA 
dinB 

Genes involved in DNA metabolism, 
but role in DNA repair unknown 

s sb 

uvrD 

himA 

recN 

Genes of unknown function 

dinD 

dinF 


Encodes polymerization subunit of DNA polymerase II, required for 
replication restart in recombinational DNA repair 

Encode ABC excinuclease subunits UvrA and UvrB 
Encode DNA polymerase V 

Encodes protein that inhibits cell division, possibly to allow time for DNA repair 
Encodes RecA protein, required for error-prone repair and recombinational repair 
Encodes DNA polymerase IV 


Encodes single-stranded DNA-binding protein (SSB) 

Encodes DNA helicase II (DNA-unwinding protein) 

Encodes subunit of integration host factor (IHF), involved in site-specific 
recombination, replication, transposition, regulation of gene expression 
Required for recombinational repair 



Note: Some of these genes and their functions are further discussed in Chapter 28. 































8885d_c25_948-994 2/11/04 1:57 PM Page 978 mac76 mac76:385. 


978 Chapter 25 DNA Metabolism 


enzymes also have specialized functions in DNA repair. 
DNA polymerase 17 (eta), for example, is a TLS poly¬ 
merase found in all eukaryotes. It promotes translesion 
synthesis primarily across cyclobutane T-T dimers. Few 
mutations result in this case, because the enzyme pref¬ 
erentially inserts two A residues across from the linked 
T residues. Several other low-fidelity polymerases, in¬ 
cluding DNA polymerases /3, 1 (iota), and A, have spe¬ 
cialized roles in eukaryotic base-excision repair. Each of 
these enzymes has a 5'-deoxyribose phosphate lyase ac¬ 
tivity in addition to its polymerase activity. After base 
removal by a glycosylase and backbone cleavage by an 
AP endonuclease, these enzymes remove the abasic site 
(a 5'-deoxyribose phosphate) and fill in the very short 
gap with their polymerase activity. The frequency of mu¬ 
tations due to DNA polymerase 17 activity is minimized 
by the very short lengths (often one nucleotide) of DNA 
synthesized. 

What emerges from research into cellular DNA re¬ 
pair systems is a picture of a DNA metabolism that main¬ 
tains genomic integrity with multiple and often redun¬ 
dant systems. In the human genome, more than 130 
genes encode proteins dedicated to the repair of DNA. 
In many cases, the loss of function of one of these pro¬ 
teins results in genomic instability and an increased oc¬ 
currence of oncogenesis (Box 25-1). These repair sys¬ 
tems are often integrated with the DNA replication 
systems and are complemented by the recombination 
systems that we turn to next. 

SUMMARY 25.2 DNA Repair 


■ Cells have many systems for DNA repair. 
Mismatch repair in E. coli is directed by 
transient nonmethylation of (5')GATC 
sequences on the newly synthesized strand. 

■ Base-excision repair systems recognize and 
repair damage caused by environmental agents 
(such as radiation and alkylating agents) and 
spontaneous reactions of nucleotides. Some 
repair systems recognize and excise only 
damaged or incorrect bases, leaving an AP 
(abasic) site in the DNA. This is repaired by 
excision and replacement of the DNA segment 
containing the AP site. 

■ Nucleotide-excision repair systems recognize 
and remove a variety of bulky lesions and 
pyrimidine dimers. They excise a segment of 
the DNA strand including the lesion, leaving a 
gap that is filled in by DNA polymerase and 
ligase activities. 

■ Some DNA damage is repaired by direct 
reversal of the reaction causing the damage: 


pyrimidine dimers are directly converted to 
monomeric pyrimidines by a photolyase, and 
the methyl group of 0 6 -methylguanine is 
removed by a methyltransferase. 

■ In bacteria, error-prone translesion DNA 
synthesis, involving TLS DNA polymerases, 
occurs in response to very heavy DNA damage. 
In eukaryotes, similar polymerases have 
specialized roles in DNA repair that minimize 
the introduction of mutations. 


25.3 DNA Recombination 

The rearrangement of genetic information within and 
among DNA molecules encompasses a variety of proc¬ 
esses, collectively placed under the heading of genetic 
recombination. The practical applications of DNA re¬ 
arrangements in altering the genomes of increasing num¬ 
bers of organisms are now being explored (Chapter 9). 

Genetic recombination events fall into at least three 
general classes. Homologous genetic recombination 
(also called general recombination) involves genetic ex¬ 
changes between any two DNA molecules (or segments 
of the same molecule) that share an extended region of 
nearly identical sequence. The actual sequence of bases 
is irrelevant, as long as it is similar in the two DNAs. In 
site-specific recombination, the exchanges occur 
only at a particular DNA se¬ 
quence. DNA transposition 
is distinct from both other 
classes in that it usually in¬ 
volves a short segment of DNA 
with the remarkable capacity 
to move from one location in a 
chromosome to another. These 
“jumping genes” were first ob¬ 
served in maize in the 1940s by 
Barbara McClintock. There is 
in addition a wide range of un¬ 
usual genetic rearrangements 
for which no mechanism or 
purpose has yet been proposed. Here we focus on the 
three general classes. 

The functions of genetic recombination systems are 
as varied as their mechanisms. They include roles in spe¬ 
cialized DNA repair systems, specialized activities in 
DNA replication, regulation of expression of certain 
genes, facilitation of proper chromosome segregation 
during eukaryotic cell division, maintenance of genetic 
diversity, and implementation of programmed genetic 
rearrangements during embryonic development. In 
most cases, genetic recombination is closely integrated 
with other processes in DNA metabolism, and this be¬ 
comes a theme of our discussion. 
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Homologous Genetic Recombination 
Has Several Functions 

In bacteria, homologous genetic recombination is pri¬ 
marily a DNA repair process and in this context (as 
noted in Section 25.2) is referred to as recombina- 
tional DNA repair. It is usually directed at the recon¬ 
struction of replication forks stalled at the site of DNA 
damage. Homologous genetic recombination can also 
occur during conjugation (mating), when chromosomal 
DNA is transferred from a donor to a recipient bacter¬ 
ial cell. Recombination during conjugation, although 
rare in wild bacterial populations, contributes to genetic 
diversity. 

In eukaryotes, homologous genetic recombination 
can have several roles in replication and cell division, 
including the repair of stalled replication forks. Recom¬ 
bination occurs with the highest frequency during meio- 
sis, the process by which diploid germ-line cells with 
two sets of chromosomes divide to produce haploid ga¬ 
metes—sperm cells or ova in higher eukaryotes—each 
gamete having only one member of each chromosome 
pair (Fig. 25-29). Meiosis begins with replication of the 
DNA in the germ-line cell so that each DNA molecule is 
present in four copies. The cell then goes through two 
rounds of cell division without an intervening round of 
DNA replication. This reduces the DNA content to the 
haploid level in each gamete. 

After the DNA is replicated during prophase of the 
first meiotic division, the resulting sister chromatids re¬ 
main associated at their centromeres. At this stage, each 
set of four homologous chromosomes exists as two pairs 
of chromatids. Genetic information is now exchanged 
between the closely associated homologous chromatids 


FIGURE 25-29 Meiosis in eukaryotic germ-line cells. The chromo¬ 
somes of a hypothetical diploic! germ-line cell (six chromosomes; three 
homologous pairs) replicate and are held together at their centromeres. 
Each replicated double-stranded DNA molecule is called a chromatid 
(sister chromatid). In prophase I, just before the first meiotic division, 
the three homologous sets of chromatids align to form tetrads, held 
together by covalent links at homologous junctions (chiasmata). 
Crossovers occur within the chiasmata (see Fig. 25-30). These tran¬ 
sient associations between homologs ensure that the two tethered 
chromosomes segregate properly in the next step, when they migrate 
toward opposite poles of the dividing cell in the first meiotic division. 
The products of this division are two daughter cells, each with three 
pairs of chromatids. The pairs now line up across the equator of the 
cell in preparation for separation of the chromatids (now called chro¬ 
mosomes). The second meiotic division produces four haploid daugh¬ 
ter cells that can serve as gametes. Each has three chromosomes, half 
the number of the diploid germ-line cell. The chromosomes have re¬ 
sorted and recombined. 


by homologous genetic recombination, a process in¬ 
volving the breakage and rejoining of DNA (Fig. 25-30). 
This exchange, also referred to as crossing over, can be 
observed with the light microscope. Crossing over links 
the two pairs of sister chromatids together at points 
called chiasmata (singular, chiasma). 
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FIGURE 25-30 Crossing over, (a) Crossing over often produces an 
exchange of genetic material, (b) The homologous chromosomes of a 
grasshopper are shown during prophase I of meiosis. Many points of 


joining (chiasmata) are evident between the two homologous pairs of 
chromatids. These chiasmata are the physical manifestation of prior 
homologous recombination (crossing over) events. 


Crossing over effectively links together all four ho¬ 
mologous chromatids, a linkage that is essential to the 
proper segregation of chromosomes in the subsequent 
meiotic cell divisions. Crossing over is not an entirely 
random process, and “hot spots” have been identified 
on many eukaryotic chromosomes. However, the as¬ 
sumption that crossing over can occur with equal prob¬ 
ability at almost any point along the length of two 
homologous chromosomes remains a reasonable ap¬ 
proximation in many cases, and it is this assumption that 
permits the genetic mapping of genes. The frequency of 
homologous recombination in any region separating two 
points on a chromosome is roughly proportional to the 
distance between the points, and this allows determi¬ 
nation of the relative positions of and distances between 
different genes. 

Homologous recombination thus serves at least 
three identifiable functions: (1) it contributes to the re¬ 
pair of several types of DNA damage; (2) it provides, in 
eukaryotic cells, a transient physical link between chro¬ 
matids that promotes the orderly segregation of chro¬ 
mosomes at the first meiotic cell division; and (3) it 
enhances genetic diversity in a population. 

Recombination during Meiosis Is Initiated 
with Double-Strand Breaks 

A likely pathway for homologous recombination during 
meiosis is outlined in Figure 25-31a. The model has 
four key features. First, homologous chromosomes are 
aligned. Second, a double-strand break in a DNA mole¬ 


cule is enlarged by an exonuclease, leaving a single¬ 
strand extension with a free 3'-hydroxyl group at the 
broken end (step (D). Third, the exposed 3' ends in¬ 
vade the intact duplex DNA, and this is followed by 
branch migration (Fig. 25-32) and/or replication to 
create a pair of crossover structures, called Holliday 
junctions (Fig. 25-31a, steps (2) to (4)). Fourth, cleav¬ 
age of the two crossovers creates two complete recom¬ 
binant products (step (5)). 

In this double-strand break repair model for re¬ 
combination, the 3' ends are used to initiate the genetic 
exchange. Once paired with the complementary strand 
on the intact homolog, a region of hybrid DNA is created 
that contains complementary strands from two differ¬ 
ent parent DNAs (the product of step (2)in Fig. 25-3la). 
Each of the 3' ends can then act as a primer for DNA 
replication. The structures thus formed, Holliday inter¬ 
mediates (Fig. 25-31b), are a feature of homologous 
genetic recombination pathways in all organisms. 

Homologous recombination can vary in many details 
from one species to another, but most of the steps out¬ 
lined above are generally present in some form. There 
are two ways to cleave, or “resolve,” the Holliday inter¬ 
mediate so that the two recombinant products carry 
genes in the same linear order as in the substrates—the 
original, unrecombined chromosomes (step (5) of Fig. 
25-31a). If cleaved one way, the DNA flanking the re¬ 
gion containing the hybrid DNA is not recombined; if 
cleaved the other way, the flanking DNA is recombined. 
Both outcomes are observed in vivo in eukaryotes and 
prokaryotes. 
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The homologous recombination illustrated in Figure 
25-31 is a very elaborate process with subtle molecular 
consequences for the generation of genetic diversity. To 
understand how this process contributes to diversity, we 
should keep in mind that the two homologous chromo¬ 


somes that undergo recombination are not necessarily 
identical. The linear array of genes may be the same, 
but the base sequences in some of the genes may differ 
slightly (in different alleles). In a human, for example, one 
chromosome may contain the allele for hemoglobin A 



Gene 

5' A 3' 5' 


Gene 



3' 5' 3' 5' 


(T) A double-strand break in one of two 
homologs is converted to a double¬ 
strand gap by the action of exonucleases. 
Strands with 3 ' ends are degraded 
less than those with 5' ends, 
producing 3' single-strand extensions. 


(2) An exposed 3' end pairs with its 
complement in the intact homolog. 
The other strand of the duplex 
is displaced. 



(§) The invading 3' end is extended by 

DNA polymerase plus branch migration, 
eventually generating a DNA molecule 
with two crossovers called Holliday 
intermediates. 



( 3 ) Further DNA replication replaces the 
DNA missing from the site of the 
original double-strand break. 



(b) 


FIGURE 25-31 Recombination during meiosis. 

(a) Model of double-strand break repair for 
homologous genetic recombination. The two 
homologous chromosomes involved in this 
recombination event have similar sequences. 

Each of the two genes shown has different alleles 
on the two chromosomes. The DNA strands and 
alleles are colored differently so that their fate is 
evident. The steps are described in the text. 

(b) A Holliday intermediate formed between two 
bacterial plasmids in vivo, as seen with the 
electron microscope. The intermediates are 
named for Robin Holliday, who first proposed 
their existence in 1964. 



© Cleavage of the Holliday intermediates 
by specialized nucleases generates 
either of the two recombination products. 
In product set 2, the DNA on either 
side of the region undergoing repair 
is recombined. 

T ▼ 


Product set 1 


Product set 2 


(a) 
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FIGURE 25-32 Branch migration. When a template strand pairs with 
two different complementary strands, a branch is formed at the point 
where the three complementary strands meet. The branch "migrates" 
when base pairing to one of the two complementary strands is bro¬ 
ken and replaced with base pairing to the other complementary strand. 
In the absence of an enzyme to direct it, this process can move the 
branch spontaneously in either direction. Spontaneous branch migra¬ 
tion is blocked wherever one of the otherwise complementary strands 
has a sequence nonidentical to the other strand. 


(normal hemoglobin) while the other contains the allele 
for hemoglobin S (the sickle-cell mutation). The differ¬ 
ence may consist of no more than one base pair among 
millions. Homologous recombination does not change the 
linear array of genes, but it can determine which alleles 
become linked together on a single chromosome. 

Recombination Requires a Host of Enzymes 
and Other Proteins 

Enzymes that promote various steps of homologous re¬ 
combination have been isolated from both prokaryotes 
and eukaryotes. In E. coli, the recB, recC, and recD 
genes encode the RecBCD enzyme, which has both he- 
licase and nuclease activities. The RecA protein pro- 


RecBCD 

enzyme 

chi 



ATP|^ 

ADP+Pj 


Helicase and nuclease activities of 
RecBCD degrade the DNA. 


▼ 


5' 

3' 


OH 3' 



On reaching a chi sequence, 
nuclease activity on the strand with 
the 3' end is suppressed. The other 
strand continues to be degraded, 
generating a 3'-terminal single- 
stranded end. 



motes all the central steps in the homologous recombi¬ 
nation process: the pairing of two DNAs, formation of 
Holliday intermediates, and branch migration (as de¬ 
scribed below). The RuvA and RuvB proteins (repair of 
UV damage) form a complex that binds to Holliday in¬ 
termediates, displaces RecA protein, and promotes 
branch migration at higher rates than does RecA. Nu¬ 
cleases that specifically cleave Holliday intermediates, 
often called resolvases, have been isolated from bacte¬ 
ria and yeast. The RuvC protein is one of at least two 
such nucleases in E. coli. 

The RecBCD enzyme binds to linear DNA at a free 
(broken) end and moves inward along the double helix, 
unwinding and degrading the DNA in a reaction coupled 
to ATP hydrolysis (Fig. 25-33). The activity of the en¬ 
zyme is altered when it interacts with a sequence re¬ 
ferred to as chi, (5')GCTGGTGG. From that point, 
degradation of the strand with a 3' terminus is greatly 
reduced, but degradation of the 5'-terminal strand is in¬ 
creased. This process creates a single-stranded DNA 
with a 3' end, which is used during subsequent steps in 
recombination (Fig. 25-31). The 1,009 chi sequences 
scattered throughout the E. coli genome enhance the 
frequency of recombination about five- to tenfold within 
1,000 bp of the chi site. The enhancement declines as 
the distance from the site increases. Sequences that en¬ 
hance recombination frequency have also been identi¬ 
fied in several other organisms. 

RecA is unusual among the proteins of DNA me¬ 
tabolism in that its active form is an ordered, helical fil¬ 
ament of up to several thousand RecA monomers that 
assemble cooperatively on DNA (Fig. 25-34). This fila- 


FIGURE 25-33 Helicase and nuclease activities of the RecBCD en¬ 
zyme. Entering at a double-stranded end, RecBCD unwinds and de¬ 
grades the DNA until it encounters a chi sequence. The interaction 
with chi alters the activity of RecBCD so that it generates a single- 
stranded DNA with a 3' end, suitable for subsequent steps in recom¬ 
bination. Movement of the enzyme requires ATP hydrolysis. This en¬ 
zyme is believed to help initiate homologous genetic recombination 
in E. coli. It is also involved in the repair of double-strand breaks at 
collapsed replication forks. 


























8885d_c25_948-994 2/11/04 1:57 PM Page 983 mac76 mac76:385. 


25.3 DNA Recombination 983 





(b) 


FIGURE 25-34 RecA. (a) Nucleoprotein filament of RecA protein on 
single-stranded DNA, as seen with the electron microscope. The stri- 
ations indicate the right-handed helical structure of the filament, 
(b) Surface contour model of a 24-subunit RecA filament. The filament 
has six subunits per turn. One subunit is colored red to provide per¬ 
spective (derived from PDB ID 2REB). 
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RecA protein binds to single-stranded or gapped 
DNA. The complementary strand of the linear DNA 
pairs with a circular single strand. The other 
linear strand is displaced (left) or pairs with its 
complement in the circular duplex to yield a 
Holliday structure (right). 


RecA 

protein 





ment normally forms on single-stranded DNA, such as 
that produced by the RecBCD enzyme. The filament will 
also form on a duplex DNA with a single-strand gap; in 
this case, the first RecA monomers bind to the single- 
stranded DNA in the gap, after which the assembled fil¬ 
ament rapidly envelops the neighboring duplex. The 
RecF, RecO, and RecR proteins regulate the assembly 
and disassembly of RecA filaments. 

A useful model to illustrate the recombination ac¬ 
tivities of the RecA filament is the in vitro DNA strand 
exchange reaction (Fig. 25-35). A single strand of DNA 
is first bound by RecA to establish the nucleoprotein fil¬ 
ament. The RecA filament then takes up a homologous 
duplex DNA and aligns it with the bound single strand. 
Strands are then exchanged between the two DNAs to 
create hybrid DNA. The exchange occurs at a rate of 
6 bp/s and progresses in the 5'—>3' direction relative to 
the single-stranded DNA within the RecA filament. This 
reaction can involve either three or four strands (Fig. 
25-35); in the latter case, a Holliday intermediate forms 
during the process. 

As the duplex DNA is incorporated within the RecA 
filament and aligned with the bound single-stranded 
DNA over regions of hundreds of base pairs, one strand 
of the duplex switches pairing partners (Fig. 25-36, 


Continued branch migration yields a circular duplex 
with a nick and a displaced linear strand (left) or a 
partially single-stranded linear duplex (right). 

FIGURE 25-35 DNA strand-exchange reactions promoted by RecA 
protein in vitro. Strand exchange involves the separation of one strand 
of a duplex DNA from its complement and transfer of the strand to 
an alternative complementary strand to form a new duplex (het¬ 
eroduplex) DNA. The transfer forms a branched intermediate. Forma¬ 
tion of the final product depends on branch migration, which is fa¬ 
cilitated by RecA. The reaction can involve three strands (left) or a 
reciprocal exchange between two homologous duplexes—four strands 
in all (right). When four strands are involved, the branched interme¬ 
diate that results is a Holliday intermediate. RecA protein promotes 
the branch-migration phases of these reactions, using energy derived 
from ATP hydrolysis. 


step (2)). Because DNA is a helical structure, continued 
strand exchange requires an ordered rotation of the two 
aligned DNAs. This brings about a spooling action (steps 
(3) and (4)) that shifts the branch point along the helix. 
ATP is hydrolyzed by RecA protein during this reaction. 

Once a Holliday intermediate has formed, a host of 
enzymes—topoisomerases, the RuvAB branch migration 
protein, a resolvase, other nucleases, DNA polymerase 
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FIGURE 25-36 Model for DNA strand exchange mediated by RecA 
protein. A three-strand reaction is shown. The balls representing RecA 
protein are undersized relative to the thickness of DNA to clarify the 
fate of the DNA strands. (T) RecA protein forms a filament on the sin¬ 
gle-stranded DNA. (2) A homologous duplex incorporates into this 
complex. ® As spooling shifts the three-stranded region from left to 
right, one of the strands in the duplex is transferred to the single strand 
originally bound in the filament. The other strand of the duplex is dis¬ 
placed, and a new duplex forms within the filament. As rotation con¬ 
tinues (@ and ©), the d isplaced strand separates entirely. In this 
model, hydrolysis of ATP by RecA protein rotates the two DNA mol¬ 
ecules relative to each other and thus directs the strand exchange from 
left to right as shown. 


I or III, and DNA ligase—are required to complete re¬ 
combination. The RuvC protein (M r 20,000) of E. coli 
cleaves Holliday intermediates to generate full-length, 
unbranched chromosome products. 

All Aspects of DNA Metabolism Come Together 
to Repair Stalled Replication Forks 

Like all cells, bacteria sustain high levels of DNA dam¬ 
age even under normal growth conditions. Most DNA 


lesions are repaired rapidly by base-excision repair, 
nucleotide-excision repair, and the other pathways de¬ 
scribed earlier. Nevertheless, almost every bacterial 
replication fork encounters an unrepaired DNA lesion 
or break at some point in its journey from the replica¬ 
tion origin to the terminus (Fig. 25-28). DNA poly¬ 
merase III cannot proceed past many types of DNA le¬ 
sions, and these encounters tend to leave the lesion in 
a single-strand gap. An encounter with a DNA strand 
break creates a double-strand break. Both situations re¬ 
quire recombinational DNA repair (Fig. 25-37). Under 
normal growth conditions, stalled replication forks are 
reactivated by an elaborate repair pathway encompass¬ 
ing recombinational DNA repair, the restart of replica¬ 
tion, and the repair of any lesions left behind. All as¬ 
pects of DNA metabolism come together in this process. 

After a replication fork has been halted, it can be 
restored by at least two major paths, both of which re¬ 
quire the RecA protein. The repair pathway for lesion- 
containing DNA gaps also requires the RecF, RecO, and 
RecR proteins. Repair of double-strand breaks requires 
the RecBCD enzyme (Fig. 25-37). Additional recom¬ 
bination steps are followed by a process called origin- 
independent restart of replication, in which the 
replication fork reassembles with the aid of a complex 
of seven proteins (PriA, B, and C, and DnaB, C, G, and 
T). This complex, originally discovered as a component 
required for the replication of <^>X174 DNA in vitro, 
is now termed the replication restart primosome. 
Restart of the replication fork also requires DNA poly¬ 
merase II, in a role not yet defined; this polymerase II 
activity gives way to DNA polymerase III for the ex¬ 
tensive replication generally required to complete the 
chromosome. 

The repair of stalled replication forks entails a coor¬ 
dinated transition from replication to recombination and 
back to replication. The recombination steps function to 
fill the DNA gap or rejoin the broken DNA branch to recre¬ 
ate the branched DNA structure at the replication fork. 
Lesions left behind in what is now duplex DNA are 
repaired by pathways such as base-excision or nucleotide- 
excision repair. Thus a wide range of enzymes encom¬ 
passing every aspect of DNA metabolism ultimately take 
part in the repair of a stalled replication fork. This type 
of repair process is clearly a primary function of the ho¬ 
mologous recombination system of every cell, and defects 
in recombinational DNA repair play an important role in 
human disease (Box 25-1). 

Site-Specific Recombination Results in Precise 
DNA Rearrangements 

Homologous genetic recombination can involve any two 
homologous sequences. The second general type of re¬ 
combination, site-specific recombination, is a very dif¬ 
ferent type of process: recombination is limited to spe- 
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FIGURE 25-37 Models for recombinational DNA repair of stalled 
replication forks. The replication fork collapses on encountering a 
DNA lesion (left) or strand break (right). Recombination enzymes pro¬ 
mote the DNA strand transfers needed to repair the branched DNA 
structure at the replication fork. A lesion in a single-strand gap is re¬ 
paired in a reaction requiring the RecF, RecO, and RecR proteins. 
Double-strand breaks are repaired in a pathway requiring the RecBCD 
enzyme. Both pathways require RecA. Recombination intermediates 


are processed by additional enzymes (e.g., RuvA, RuvB, and RuvC, 
which process Holliday intermediates). Lesions in double-stranded 
DNA are repaired by nucleotide-excision repair or other pathways. 
The replication fork re-forms with the aid of enzymes catalyzing 
origin-independent replication restart, and chromosomal replication 
is completed. The overall process requires an elaborate coordination 
of all aspects of bacterial DNA metabolism. 


cific sequences. Recombination reactions of this type 
occur in virtually every cell, filling specialized roles that 
vary greatly from one species to another. Examples in¬ 
clude regulation of the expression of certain genes and 
promotion of programmed DNA rearrangements in em¬ 


bryonic development or in the replication cycles of some 
viral and plasmid DNAs. Each site-specific recombina¬ 
tion system consists of an enzyme called a recombinase 
and a short (20 to 200 bp), unique DNA sequence where 
the recombinase acts (the recombination site). One or 
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(a) 


more auxiliary proteins may regulate the timing or out¬ 
come of the reaction. 

In vitro studies of many site-specific recombination 
systems have elucidated some general principles, in¬ 
cluding the fmidamental reaction pathway (Fig. 25-38a). 
A separate recombinase recognizes and binds to each 
of two recombination sites on two different DNA mole¬ 
cules or within the same DNA. One DNA strand in each 
site is cleaved at a specific point within the site, and the 
recombinase becomes covalently linked to the DNA at 
the cleavage site through a phosphotyrosine (or phos- 
phoserine) bond (step (I)). The transient protein-DNA 
linkage preserves the phosphodiester bond that is lost 
in cleaving the DNA, so high-energy cofactors such as 
ATP are unnecessary in subsequent steps. The cleaved 
DNA strands are rejoined to new partners to form a 
Holliday intermediate, with new phosphodiester bonds 
created at the expense of the protein-DNA linkage (step 



(b) 


FIGURE 25-38 A site-specific recombination reaction, (a) The reac¬ 
tion shown here is for a common class of site-specific recombinases 
called integrase-class recombinases (named after bacteriophage A in- 
tegrase, the first recombinase characterized). The reaction is carried 
out within a tetramer of identical subunits. Recombinase subunits 
bind to a specific sequence, often called simply the recombination 
site, (l) One strand in each DNA is cleaved at particular points within 
the sequence. The nucleophile is the OH group of an active-site Tyr 
residue, and the product is a covalent phosphotyrosine link between 
protein and DNA. (2) The cleaved strands join to new partners, pro¬ 
ducing a Holliday intermediate. Steps (3) and @ complete the re¬ 
action by a process similar to the first two steps. The original sequence 
of the recombination site is regenerated after recombining the DNA 
flanking the site. These steps occur within a complex of multiple 
recombinase subunits that sometimes includes other proteins not 
shown here, (b) A surface contour model of a four-subunit integrase- 
class recombinase called the Cre recombinase, bound to a Holliday 
intermediate (shown with light blue and dark blue helix strands). The 
protein has been rendered transparent so that the bound DNA is vis¬ 
ible (derived from PDB ID 3CRX). 

(2)). To complete the reaction, the process must be re¬ 
peated at a second point within each of the two recom¬ 
bination sites (steps (3) and (4)). In some systems, both 
strands of each recombination site are cut concurrently 
and rejoined to new partners without the Holliday inter¬ 
mediate. The exchange is always reciprocal and precise, 
regenerating the recombination sites when the reaction 
is complete. We can view a recombinase as a site-specific 
endonuclease and ligase in one package. 

The sequences of the recombination sites recognized 
by site-specific recombinases are partially asymmetric 
(nonpalindromic), and the two recombining sites align in 
the same orientation during the recombinase reaction. 
The outcome depends on the location and orientation of 
the recombination sites (Fig. 25-39). If the two sites are 
on the same DNA molecule, the reaction either inverts 
or deletes the intervening DNA, determined by whether 
the recombination sites have the opposite or the same 
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Inversion 


Deletion and insertion 



FIGURE 25-39 Effects of site-specific recombination. The outcome 
of site-specific recombination depends on the location and orientation 
of the recombination sites (red and green) in a double-stranded DNA 
molecule. Orientation here (shown by arrowheads) refers to the order 
of nucleotides in the recombination site, not the 5'->3' direction. 


(a) Recombination sites with opposite orientation in the same DNA 
molecule. The result is an inversion, (b) Recombination sites with the 
same orientation, either on one DNA molecule, producing a deletion, 
or on two DNA molecules, producing an insertion. 


orientation, respectively. If the sites are on different 
DNAs, the recombination is intermolecular; if one or both 
DNAs are circular, the result is an insertion. Some re- 
combinase systems are highly specific for one of these 
reaction types and act only on sites with particular 
orientations. 

The first site-specific recombination system studied 
in vitro was that encoded by bacteriophage A. When A 
phage DNA enters an E. coli cell, a complex series of 
regulatory events commits the DNA to one of two fates. 


The A DNA either replicates and produces more bacte¬ 
riophages (destroying the host cell) or integrates into 
the host chromosome, replicating passively along with 
the chromosome for many cell generations. Integration 
is accomplished by a phage-encoded recombinase (A in- 
tegrase) that acts at recombination sites on the phage 
and bacterial DNAs—at attachment sites attP and attB, 
respectively (Fig. 25-40). The role of site-specific re¬ 
combination in regulating gene expression is considered 
in Chapter 28. 
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FIGURE 25-40 Integration and excision of bacteriophage A DNA at 
the chromosomal target site. The attachment site on the A phage DNA 
(attP) shares only 15 bp of complete homology with the bacterial site 
(attB) in the region of the crossover. The reaction generates two new 
attachment sites (attR and attL) flanking the integrated phage DNA. 


The recombinase is the A integrase (or INT protein). Integration and 
excision use different attachment sites and different auxiliary proteins. 
Excision uses the proteins XIS, encoded by the bacteriophage, and FIS, 
encoded by the bacterium. Both reactions require the protein IHF (In¬ 
tegration host /actor), encoded by the bacterium. 
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Complete Chromosome Replication Can Require 
Site-Specific Recombination 

Recombinational DNA repair of a circular bacterial chro¬ 
mosome, while essential, sometimes generates deleteri¬ 
ous byproducts. The resolution of a Holliday junction at 
a replication fork by a nuclease such as RuvC, followed 
by completion of replication, can give rise to one of two 
products: the usual two monomeric chromosomes or a 
contiguous dimeric chromosome (Fig. 25-41). In the lat¬ 
ter case, the covalently linked chromosomes cannot be 
segregated to daughter cells at cell division and the di¬ 
viding cells become “stuck.” A specialized site-specific 
recombination system in E. coli, the XerCD system, con¬ 
verts the dimeric chromosomes to monomeric chromo¬ 
somes so that cell division can proceed. The reaction is 
a site-specific deletion reaction (Fig. 25-39b). This is 
another example of the close coordination between DNA 
recombination processes and other aspects of DNA 
metabolism. 

F ork undergoing 




resolution to monomers 
by XerCD system 



Transposable Genetic Elements Move from One 
Location to Another 

We now consider the third general type of recombina¬ 
tion system: recombination that allows the movement 
of transposable elements, or transposons. These seg¬ 
ments of DNA, found in virtually all cells, move, or 
“jump,” from one place on a chromosome (the donor 
site) to another on the same or a different chromosome 
(the target site). DNA sequence homology is not usu¬ 
ally required for this movement, called transposition; 
the new location is determined more or less randomly. 
Insertion of a transposon in an essential gene could kill 
the cell, so transposition is tightly regulated and usually 
very infrequent. Transposons are perhaps the simplest 
of molecular parasites, adapted to replicate passively 
within the chromosomes of host cells. In some cases 
they carry genes that are useful to the host cell, and 
thus exist in a kind of symbiosis with the host. 

Bacteria have two classes of transposons. Insertion 
sequences (simple transposons) contain only the se¬ 
quences required for transposition and the genes for 
proteins (transposases) that promote the process. 
Complex transposons contain one or more genes in 
addition to those needed for transposition. These extra 
genes might, for example, confer resistance to antibi¬ 
otics and thus enhance the survival chances of the host 
cell. The spread of antibiotic-resistance elements among 
disease-causing bacterial populations that is rendering 
some antibiotics ineffectual (pp. 925-926) is mediated 
in part by transposition. 

Bacterial transposons vary in structure, but most 
have short repeated sequences at each end that serve 
as binding sites for the transposase. When transposition 
occurs, a short sequence at the target site (5 to 10 bp) 
is duplicated to form an additional short repeated se¬ 
quence that flanks each end of the inserted transposon 
(Fig. 25-42). These duplicated segments result from the 
cutting mechanism used to insert a transposon into the 
DNA at a new location. 

There are two general pathways for transposition in 
bacteria. In direct or simple transposition (Fig. 25-43, 
left), cuts on each side of the transposon excise it, and 
the transposon moves to a new location. This leaves a 
double-strand break in the donor DNA that must be 


FIGURE 25-41 DNA deletion to undo a deleterious effect of re- 
combinational DNA repair. The resolution of a Holliday intermediate 
during recombinational DNA repair (if cut at the points indicated by 
red arrows) can generate a contiguous dimeric chromosome. A spe¬ 
cialized site-specific recombinase in E. coli, XerCD, converts the dimer 
to monomers, allowing chromosome segregation and cell division to 
proceed. 
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FIGURE 25-42 Duplication of the DNA sequence at a target site 
when a transposon is inserted. The duplicated sequences are shown 
in red. These sequences are generally only a few base pairs long, so 
their size (compared with that of a typical transposon) is greatly ex¬ 
aggerated in this drawing. 


Transposase makes 
staggered cuts 
in the target site. 




Target DNA 

I 


repaired. At the target site, a staggered cut is made (as 
in Fig. 25-42), the transposon is inserted into the break, 
and DNA replication fills in the gaps to duplicate the 
target site sequence. In replicative transposition (Fig. 
25-43, right), the entire transposon is replicated, leav¬ 
ing a copy behind at the donor location. A cointegrate 
is an intermediate in this process, consisting of the 
donor region covalently linked to DNA at the target site. 
Two complete copies of the transposon are present in 
the cointegrate, both having the same relative orienta¬ 
tion in the DNA. In some well-characterized trans- 
posons, the cointegrate intermediate is converted to 
products by site-specific recombination, in which spe¬ 
cialized recombinases promote the required deletion 
reaction. 
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FIGURE 25-43 Two general pathways for transposition: direct 
(simple) and replicative. (D The DNA is first cleaved on each side of 
the transposon, at the sites indicated by arrows. © The liberated 3'- 
hydroxyl groups at the ends of the transposon act as nucleophiles in 
a direct attack on phosphodiester bonds in the target DNA. The target 
phosphodiester bonds are staggered (not directly across from each 
other) in the two DNA strands. (3) The transposon is now linked to 


the target DNA. In direct transposition, replication fills in gaps at each 
end. In replicative transposition, the entire transposon is replicated to 
create a cointegrate intermediate. @ The cointegrate is often resolved 
later, with the aid of a separate site-specific recombination system. 
The cleaved host DNA left behind after direct transposition is either 
repaired by DNA end-joining or degraded (not shown). The latter 
outcome can be lethal to an organism. 
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FIGURE 25-44 Recombination of the V and J 
gene segments of the human IgG kappa light 
chain. This process is designed to generate 
antibody diversity. At the top is shown the 
arrangement of IgG-coding sequences in a bone 
marrow stem cell. Recombination deletes the DNA 
between a particular V segment and a J segment. After 
transcription, the transcript is processed by RNA splicing, 
as described in Chapter 26; translation produces the 
light-chain polypeptide. The light chain can combine 
with any of 5,000 possible heavy chains to produce an 
antibody molecule. 
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Eukaryotes also have transposons, structurally sim¬ 
ilar to bacterial transposons, and some use similar trans¬ 
position mechanisms. In other cases, however, the 
mechanism of transposition appears to involve an RNA 
intermediate. Evolution of these transposons is inter¬ 
twined with the evolution of certain classes of RNA 
viruses. Both are described in the next chapter. 

Immunoglobulin Genes Assemble by Recombination 

Some DNA rearrangements are a programmed part of 
development in eukaryotic organisms. An important ex¬ 
ample is the generation of complete immunoglobulin 
genes from separate gene segments in vertebrate 
genomes. A human (like other mammals) is capable of 
producing millions of different immunoglobulins (anti¬ 
bodies) with distinct binding specificities, even though 
the human genome contains only -35,000 genes. Recom¬ 
bination allows an organism to produce an extraordinary 
diversity of antibodies from a limited DNA-coding 
capacity. Studies of the recombination mechanism 
reveal a close relationship to DNA transposition and 
suggest that this system for generating antibody diver¬ 
sity may have evolved from an ancient cellular invasion 
of transposons. 

We can use the human genes that encode proteins 
of the immunoglobulin G (IgG) class to illustrate how 
antibody diversity is generated. Immunoglobulins con¬ 
sist of two heavy and two light polypeptide chains (see 
Fig. 5-23). Each chain has two regions, a variable re¬ 
gion, with a sequence that differs greatly from one im¬ 
munoglobulin to another, and a region that is virtually 
constant within a class of immunoglobulins. There are 
also two distinct families of light chains, kappa and 
lambda, which differ somewhat in the sequences of their 
constant regions. For all three types of polypeptide 
chain (heavy chain, and kappa and lambda light chains), 
diversity in the variable regions is generated by a simi- 
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lar mechanism. The genes for these polypeptides are di¬ 
vided into segments, and the genome contains clusters 
with multiple versions of each segment. The joining of 
one version of each of the segments creates a complete 
gene. 

Figure 25-44 depicts the organization of the DNA 
encoding the kappa light chains of human IgG and shows 
how a mature kappa light chain is generated. In undif¬ 
ferentiated cells, the coding information for this 
polypeptide chain is separated into three segments. The 
V (variable) segment encodes the first 95 amino acid 
residues of the variable region, the J (joining) segment 
encodes the remaining 12 residues of the variable re¬ 
gion, and the C segment encodes the constant region. 
The genome contains -300 different V segments, 4 dif¬ 
ferent J segments, and 1 C segment. 

As a stem cell in the bone marrow differentiates to 
form a mature B lymphocyte, one V segment and one J 
segment are brought together by a specialized recom¬ 
bination system (Fig. 25-44). During this programmed 
DNA deletion, the intervening DNA is discarded. There 
are about 300 X 4 = 1,200 possible V-J combinations. 
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The recombination process is not as precise as the site- 
specific recombination described earlier, so additional 
variation occurs in the sequence at the V-J junction. This 
increases the overall variation by a factor of at least 2.5, 
thus the cells can generate about 2.5 X 1,200 = 3,000 
different V-J combinations. The final joining of the V-J 
combination to the C region is accomplished by an RNA- 
splicing reaction after transcription, a process described 
in Chapter 26. 

The recombination mechanism for joining the V and 
J segments is illustrated in Figure 25-45. Just beyond 
each V segment and just before each J segment lie re¬ 
combination signal sequences (RSS). These are bound 
by proteins called RAG1 and RAG2 (recombination ac¬ 
tivating gene). The RAG proteins catalyze the formation 
of a double-strand break between the signal sequences 
and the V (or J) segments to be joined. The V and J 
segments are then joined with the aid of a second com¬ 
plex of proteins. 


Intervening 
DNA ^ 

V segment RSS RSS J segment 



NV c 


double-strand 
break repair 
via end-joining 


V J 

FIGURE 25-45 Mechanism of immunoglobulin gene rearrangement. 

The RAG1 and RAG2 proteins bind to the recombination signal se¬ 
quences (RSS) and cleave one DNA strand between the RSS and the 
V (or J) segments to be joined. The liberated 3' hydroxyl then acts as 
a nucleophile, attacking a phosphodiester bond in the other strand to 
create a double-strand break. The resulting hairpin bends on the V and 
J segments are cleaved, and the ends are covalently linked by a com¬ 
plex of proteins specialized for end-joining repair of double-strand 
breaks. The steps in the generation of the double-strand break cat¬ 
alyzed by RAG! and RAG2 are chemically related to steps in trans¬ 
position reactions. 


The genes for the heavy chains and the lambda light 
chains form by similar processes. Heavy chains have more 
gene segments than light chains, with more than 5,000 
possible combinations. Because any heavy chain can com¬ 
bine with any light chain to generate an inuuunoglobulin, 
each human has at least 3,000 X 5,000 = 1.5 X 10' 
possible IgGs. And additional diversity is generated by 
high mutation rates (of unknown mechanism) in the 
V sequences during B-lymphocyte differentiation. Each 
mature B lymphocyte produces only one type of anti¬ 
body, but the range of antibodies produced by different 
cells is clearly enormous. 

Did the immune system evolve in part from ancient 
transposons? The mechanism for generation of the 
double-strand breaks by RAG1 and RAG2 does mirror 
several reaction steps in transposition (Fig. 25-45). In 
addition, the deleted DNA, with its terminal RSS, has a 
sequence structure found in most transposons. In the test 
tube, RAG1 and RAG2 can associate with this deleted 
DNA and insert it, transposonlike, into other DNA 
molecules (probably a rare reaction in B lymphocytes). 
Although we cannot know for certain, the properties of 
the immunoglobulin gene rearrangement system suggest 
an intriguing origin in which the distinction between 
host and parasite has become blurred by evolution. 

SUMMARY 25.3 DNA Recombination 


■ DNA sequences are rearranged in recombina¬ 
tion reactions, usually in processes tightly 
coordinated with DNA replication or repair. 

■ Homologous genetic recombination can take 
place between any two DNA molecules that 
share sequence homology. In meiosis (in 
eukaryotes), this type of recombination helps 
to ensure accurate chromosomal segregation 
and create genetic diversity. In both bacteria 
and eukaryotes it serves in the repair of stalled 
replication forks. A Holliday intermediate forms 
during homologous recombination. 

■ Site-specific recombination occurs only at 
specific target sequences, and this process can 
also involve a Holliday intermediate. 
Recombinases cleave the DNA at specific 
points and ligate the strands to new partners. 
This type of recombination is found in virtually 
all cells, and its many functions include DNA 
integration and regulation of gene expression. 

■ In virtually all cells, transposons use 
recombination to move within or between 
chromosomes. In vertebrates, a programmed 
recombination reaction related to transposition 
joins immunoglobulin gene segments to form 
immunoglobulin genes during B-lymphocyte 
differentiation. 
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Problems 


1. Conclusions from the Meselson-Stahl Experiment 

The Meselson-Stahl experiment (see Fig. 25-2) proved that 
DNA undergoes semiconservative replication ini?, coli. In the 
“dispersive” model of DNA replication, the parent DNA 
strands are cleaved into pieces of random size, then joined 
with pieces of newly replicated DNA to yield daughter du¬ 
plexes. In the Meselson-Stahl experiment, each strand would 
contain random segments of heavy and light DNA. Explain 
how the results of Meselson and Stahl’s experiment ruled out 
such a model. 

2. Heavy Isotope Analysis of DNA Replication A cul¬ 
ture of E. coli growing in a medium containing 15 NH 4 C1 is 
switched to a medium containing 14 NH 4 C1 for three genera¬ 
tions (an eightfold increase in population). What is the mo¬ 
lar ratio of hybrid DNA ( 16 N- 14 N) to light DNA ( 14 N- 14 N) at 
this point? 

3. Replication of the E. coli Chromosome The E. coli 
chromosome contains 4,639,221 bp. 

(a) How many turns of the double helix must be un¬ 
wound during replication of the E. coli chromosome? 

(b) From the data in this chapter, how long would it take 
to replicate the E. coli chromosome at 37 °C if two replica¬ 


tion forks proceeded from the origin? Assume replication oc¬ 
curs at a rate of 1,000 bp/s. Under some conditions E. coli 
cells can divide every 20 min. How might this be possible? 

(c) In the replication of the E. coli chromosome, about 
how many Okazaki fragments would be formed? What factors 
guarantee that the numerous Okazaki fragments are assem¬ 
bled in the correct order in the new DNA? 

4. Base Composition of DNAs Made from Single- 
Stranded Templates Predict the base composition of the 
total DNA synthesized by DNA polymerase on templates pro¬ 
vided by an equimolar mixture of the two complementary 
strands of bacteriophage 0X174 DNA (a circular DNA mole¬ 
cule). The base composition of one strand is A, 24.7%; G, 
24.1%; C, 18.5%; and T, 32.7%. What assumption is neces¬ 
sary to answer this problem? 

5. DNA Replication Kornberg and his colleagues incu¬ 
bated soluble extracts of E. coli with a mixture of dATP, dTTP, 
dGTP, and dCTP, all labeled with 32 P in the a-phosphate 
group. After a time, the incubation mixture was treated with 
trichloroacetic acid, which precipitates the DNA but not the 
nucleotide precursors. The precipitate was collected, and the 
extent of precursor incorporation into DNA was determined 
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from the amount of radioactivity present in the precipitate. 

(a) If any one of the four nucleotide precursors were 
omitted from the incubation mixture, would radioactivity be 
found in the precipitate? Explain. 

(b) Would 32 P be incorporated into the DNA if only 
dTTP were labeled? Explain. 

(c) Would radioactivity be found in the precipitate if 32 P 
labeled the or y phosphate rather than the a phosphate of 
the deoxyribonucleotides? Explain. 

6. Leading and Lagging Strands Prepare a table that 
lists the names and compares the functions of the precursors, 
enzymes, and other proteins needed to make the leading ver¬ 
sus lagging strands during DNA replication in E. coli. 

7. Function of DNA Ligase Some E. coli mutants con¬ 
tain defective DNA ligase. When these mutants are exposed 
to 3 H-labeled thymine and the DNA produced is sedimented 
on an alkaline sucrose density gradient, two radioactive bands 
appear. One corresponds to a high molecular weight fraction, 
the other to a low molecular weight fraction. Explain. 

8. Fidelity of Replication of DNA What factors pro¬ 
mote the fidelity of replication during the synthesis of the 
leading strand of DNA? Would you expect the lagging strand 
to be made with the same fidelity? Give reasons for your 
answers. 

9. Importance of DNA Topoisomerases in DNA Repli¬ 
cation DNA unwinding, such as that occurring in replica¬ 
tion, affects the superhelical density of DNA. In the absence 
of topoisomerases, the DNA would become overwound ahead 
of a replication fork as the DNA is unwound behind it. A bac¬ 
terial replication fork will stall when the superhelical density 
O) of the DNA ahead of the fork reaches +0.14 (see Chap¬ 
ter 24). 

Bidirectional replication is initiated at the origin of a 
6,000 bp plasmid in vitro, in the absence of topoisomerases. 
The plasmid initially has a cr of —0.06. How many base pairs 
will be unwound and replicated by each replication fork be¬ 
fore the forks stall? Assume that each fork travels at the same 
rate and that each includes all components necessary for 
elongation except topoisomerase. 


10. The Ames Test In a nutrient medium that lacks histi¬ 
dine, a thin layer of agar containing ~10 9 Salmonella ty- 
phimurium histidine auxotrophs (mutant cells that require 
histidine to survive) produces ~13 colonies over a two-day 
incubation period at 37°C (see Fig. 25-19). How do these 
colonies arise in the absence of histidine? The experiment is 
repeated in the presence of 0.4 pg of 2-aminoanthracene. The 
number of colonies produced over two days exceeds 10,000. 
What does this indicate about 2-aminoanthracene? What can 
you surmise about its carcinogenicity? 

11. DNA Repair Mechanisms Vertebrate and plant cells 
often methylate cytosine in DNA to form 5-methylcytosine 
(see Fig. 8-5a). In these same cells, a specialized repair sys¬ 
tem recognizes G-T mismatches and repairs them to G=C base 
pairs. How might this repair system be advantageous to the 
cell? (Explain in terms of the presence of 5-methylcytosine in 
the DNA.) 


E—Sj 12. DNA Repair in People with Xeroderma Pig- 
mentosum The condition known as xeroderma pig¬ 
mentosum (XP) arises from mutations in at least seven dif¬ 
ferent human genes. The deficiencies are generally in genes 
encoding enzymes involved in some part of the pathway for 
human nucleotide-excision repair. The various types of XP 
are labeled A through G (XPA, XPB, etc.), with a few addi¬ 
tional variants lumped under the label XPV. 

Cultures of cells from healthy individuals and from pa¬ 
tients with XPG are irradiated with ultraviolet light. The DNA 
is isolated and denatured, and the resulting single-stranded 
DNA is characterized by analytical ultracentrifugation. 

(a) Samples from the normal fibroblasts show a signifi¬ 
cant reduction in the average molecular weight of the single- 
stranded DNA after irradiation, but samples from the XPG fi¬ 
broblasts show no such reduction. Why might this be? 

(b) If you assume that a nucleotide-excision repair sys¬ 
tem is operative, which step might be defective in the fi¬ 
broblasts from the patients with XPG? Explain. 


13. Holliday Intermediates How does the formation of 
Holliday intermediates in homologous genetic recombination 
differ from their formation in site-specific recombination? 




















